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Preface 


This book aims to provide an introductory yet solid background in stellar dynamics 
to astronomy students who later will attend more advanced courses (e.g., at the grad- 
uate level) and to researchers in other fields of astrophysics looking for an accessibile 
introduction to the subject. It should provide the reader with sufficiently broad (yet 
rigorous) coverage of some of the basic topics of stellar dynamics, opening the way 
to the study of specialized texts such as Dynamics of Galaxies (Bertin 2014), Galactic 
Dynamics (Binney and Tremaine 2008), Dynamics and Evolution of Galactic Nuclei 
(Merritt 2013), and Dynamical Evolution of Globular Clusters (Spitzer 1987), and to a 
fruitful reading of important reviews such as those of Binney (1982a), Cappellari (2016), 
and de Zeeuw and Franx (1991). However, it is assumed that the student already knows 
the basics of extragalactic astronomy (e.g., the classification, morphology, kinematics, and 
phenomenology of stellar systems such as open and globular clusters, galaxies, and clusters 
of galaxies); from this point of view, Galactic Astronomy (Binney and Merrifield 1998) 
and Spiral Galaxies (Bertin and Lin 1996) are excellent complementary works. From the 
mathematical point of view, a working knowledge of geometry, linear algebra, and calculus 
in one and several variables is required. 

This book does not cover the whole content of my courses (Extragalactic Astrophysics 
and Dynamics of Stellar Systems for third- and fourth-year undergraduate astronomy 
students) taught at the University of Bologna. In fact, as any judicious student knows by 
experience, no book can substitute for the understanding of a subject obtained from direct 
interaction with a teacher and from doing first-hand exercises (sometimes erroneously!) at 
the blackboard. I devoted some care to the addition of arguments and illustrative examples 
that are not usually found in other presentations but that I found quite effective in my classes 
for clarifying important conceptual points. Informal interactions with colleagues lead me 
to expect that this book will also be useful to professional astronomers not working directly 
on stellar dynamics who are looking for a simple but sufficiently rigorous exposition of 
the most important concepts and techniques used in the field. At the end of each chapter, 
the student will find worked-out exercises: they should not be considered optional, but as 
complementary material supporting the arguments discussed in the main text. The student 
will notice that several important topics of stellar dynamics are not discussed in detail, or 
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are not even mentioned at all; examples include instabilities and resonances, the evolution 
of collisional systems such as globular and open clusters, the kinematics and dynamics 
of spiral arms, the dynamical evolution of galaxy centers under the effects of single and 
multiple supermassive black holes, and the field of numerical simulations. The absence 
of these topics is motivated by the lack of sufficient expertise of the author, or because a 
proper treatment would be beyond the scope of this introductory book, or, finally (but not 
less importantly), simply due to personal taste. 

I am especially grateful to the many students who attended my classes over the years, 
both in Bologna and at the Scuola Normale Superiore of Pisa, where I taught in 1998—2003 
and 2006-2008, during which time I wrote a little collection of Lecture Notes on Stellar 
Dynamics (Ciotti 2000): their questions often helped to reveal weak points in my under- 
standing of the subject. Discussions and exchanges of opinions with several colleagues have 
also deeply influenced (directly or indirectly) the writing of this book. Among them, special 
acknowledgment is due to G. Bertin, J. Binney, A. D’Ercole, T. de Zeeuw, D. Lynden-Bell, 
J. Ostriker, S. Pellegrini, A. Renzini, R. Sancisi, M. Stiavelli, and T. S. van Albada. I also 
wish to thank W. Dehnen, W. Evans, O. Gerhard, J. Goodman, L. Hernquist, D. Merritt, 
L. Ossipkov, D. Pfenniger, E. Remiddi, D. Spergel, A. Toomre, S. Tremaine, and my 
coauthors and students, who are too numerous to be listed by name. Finally, B. Franchi, 
A. Parmeggiani, and M. C. Tesi of the Department of Mathematics of the University of 
Bologna are acknowledged for having been generous with their time spent on clarifying 
my understanding of particular mathematical issues; of course, I alone am responsible for 
any error in this book. 

V. Higgs, H. Cockburn, L. Edwards, E. Ferns, E. Migueliz, and the entire staff of 
Cambridge University Press are warmly acknowledged for their untiring patience and sup- 
port during all stages of the completion of this project. H. Niranjana as project manager 
and J. S. Marr as copyeditor are thanked for their work. 

On a more personal note, I warmly thank G. Bertin for his friendship: without his 
constant encouragement, this book simply would not exist. Finally, I wish to thank my wife, 
Silvia, and our daughters, Anna Magdalena and Sofia Elisabetta, for their patience and help 
over the years of writing: this book is dedicated to them, and to my parents, Marino and 
Enrica. 


La Scienza é ultima perfezione de la nostra anima. 
Dante Alighieri (1265-1321), Convivio, Trattato Primo, Capitolo Primo 


Part I 
Potential Theory 


1 
The Gravitational Field 


This chapter is aimed at introducing in an elementary yet rigorous way the mathematical 
properties of the Newtonian gravitational field — the basic entity — together with the Second 
Law of Dynamics, upon which stellar dynamics is founded. We begin from the gravitational 
field of a point mass, and then we move to consider the field of extended mass distributions 
by using the superposition principle. A direct proof of Newton’s first and second theorems 
for homogeneous shells is worked out, followed by a different derivation based on the 
Gauss theorem. 


1.1 The Gravitational Field of a Point Mass 
and of Extended Distributions 


The understanding of the structure, equilibrium, and dynamical evolution of stellar systems 
(to be understood in a broad sense, ranging from small galactic open clusters up to giant 
clusters of galaxies) in terms of the fundamental physical laws, and in particular of classical 
gravity, is the main subject of stellar dynamics. Stellar systems are immense when com- 
pared to the human scale, and so a sense of the astonishing values of the masses, lengths, 
and times involved is absolutely necessary, especially for students of sister disciplines 
such as mathematics or physics, who may lack a specific background in observational 
astronomy. 

So, let us start by considering our Sun, a yellow G-dwarf star (but not at all “dwarf” 
when compared to the masses and sizes of the vast majority of the N, * 3 x 10!! stars 
present in our galaxy, the Milky Way), with a mass of Mo ~ 1.98 x 10°? g and a radius 
of Ro ~ 7x 10!° cm. The shape of our galaxy is that of a quite flattened disk, with a 
radius of +26 kpc (1 kpc = 10° pe = 3.08 x 107! cm), a central “bulge.” and a disk 
thickness of *1 kpc. The Sun rotates around the galactic center on an almost circular orbit 
of radius ~8 kpc within the galactic disk. Detailed information about our galaxy and other 
stellar systems can be found in books such as Bertin (2014), Bertin and Lin (1996), Binney 
and Merrifield (1998), Binney and Tremaine (2008), Cimatti et al. (2019), and Sparke and 
Gallagher (2007). To appreciate these figures in their full glory, it is useful to construct a 
scaled-down model of the Milky Way, such as one in which the Sun is imagined as a little 
sphere of ~ 0.7 mm radius; in this model, all lengths are reduced by a factor of 107!?. 
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The Earth is now an invisible grain of dust, the radius of the Moon’s orbit is ~0.4 mm, 
while the Earth’s orbit is an (almost circular) ellipse with a semimajor axis of ~15 cm. 
The giant planets Jupiter, Saturn, Uranus, and Neptune rotate around the Sun at average 
distances of 0.78 m, 1.40 m, 2.87 m, and 4.50 m, respectively. On this scale, the nearest 
star (actually a multiple star, the Alpha Centauri system) is placed at ~41 km from our Sun, 
the thickness of the Milky Way disk is ~3 x 10+ km, and its diameter is +1.6 x 10° km! 
The Sun would revolve around the galactic center on a roughly circular orbit with a radius 
of 2.5 x 10° km at a velocity of 2.1 x 1074 mm/s (i.e., ©0.8 mm/hr) and a period! of 
230 Myr (corresponding to a physical circular velocity of ~220 km/s). 

Therefore, it should not be surprising that, with extremely good approximation, in most 
(but not all) applications of stellar dynamics, stars can be considered point masses (see 
also Exercise 1.1). We then start our study quite naturally by considering the gravitational 
field of a point mass. Notice that the relevance of this case goes well beyond the point- 
mass approximation of stars in galaxies because, as the gravitational field produced by a 
generic mass distribution is given by the sum of the fields produced by each of its parts, 
the point masses that will be used in the next two chapters can also be interpreted as atoms 
or molecules when computing the gravitational field produced by a macroscopic material 
object! 

The starting point of Newtonian gravity is the definition of the gravitational field g, at 
position x, produced by a material point of mass m at position y in some reference system 
So. Empirically, it is found that the field is radial, with 
x—y 


g(x) = —Gm_—,, 
Ix — yl 


(1.1) 
where G ~ 6.67x 107% cm?/s*g is the universal gravitational constant, ||... || 
/(.., ++.) is the standard Euclidean norm, and (, ) is the usual inner product over 
2? (see Appendix A.1); from now on we indicate vectors with bold letters, if not stated 
otherwise. Note that the field is not defined at the particle position and that g(x) depends on 
time through the position of the particle y(t): the classical gravitational field “propagates” 
instantaneously over all of the space. Equation (1.1), formally identical to that describing 
the electrostatic field produced by a point charge, encapsulates the two most important 
properties of the gravitational field of a point mass (1.e., the fact that the modulus of the field 
decreases radially as the inverse of the square distance from the particle and the fact that 
the force between two particles is attractive). The third fundamental property of classical 
gravity, confirmed by an enormous body of experimental evidence (a point unfortunately 
not always sufficiently stressed; however, see e.g. Feynman et al. 1977 among the notable 
exceptions), is the superposition principle (i.e., the fact that the gravitational field produced 
at the point x by two point masses of masses m, and mp placed at y; and y2 is the vector 


1 For order-of-magnitude estimates, it is useful to recall that 1 yr ~ 2 x 107 s, and that a velocity of | km/s corresponds to 
~1 pe/Myr. 
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sum g1(x) + go(x) of the two fields). As a matter of fact, the whole theory of classical 
gravitation can be built from Eq. (1.1) and the superposition principle. 

For example, the gravitational field produced at x by an extended mass distribution of 
density p(y), such as a star, a planet, or a galaxy, can be immediately written as 


1 a= y 3 

gx) =-G] ——Jely)d’y, (1.2) 
x IIx — yll 

where p(y)d*y is the mass element in the infinitesimal integration volume d*y and the 

integral (a sum) embodies the superposition principle. Of course, in the special case of N 

point masses of mass m; and position x;, one can define 


N 


ply) = >) mjs(y — xj), (1.3) 


i=l 


where 6 is the so-called Dirac 5-function (actually a distribution; see Appendix A.2.5), and 
so Eq. (1.2) reduces to a standard sum, as expected from Eq. (1.1). 

The student should appreciate that, in principle, Eq. (1.2) contains all of the informa- 
tion we need to determine the gravitational field of a given density distribution. In other 
words, Eq. (1.2) is the general solution of the problem of the calculation of the Newtonian 
gravitational field produced by an assigned mass distribution o: one could conclude that 
the only problem to be addressed is just how to calculate (analytically or numerically) the 
integral (1.2) in all of the cases of interest. However, this conclusion would be very wrong. 
In fact, the most profound properties of the gravitational field cannot be derived by simple 
evaluation of Eq. (1.2), and (as is common in physics) the defining equations of a problem 
invariably contain much more information than the solution itself. For these reasons, we 
now start from the “solution” given by Eqs. (1.1) and (1.2), and we look for the differential 
equations leading to this solution. Significant effort will then be spent in the study of the 
properties and implications of the obtained equations, and this effort will be repaid by the 
discovery of powerful mathematical methods that will lead us to a deep understanding of 
the gravitational field produced by mass distributions and, finally, as a useful by-product, 
to general techniques for the evaluation of Eq. (1.2). 


1.2 Newton’s First and Second Theorems 


One of Newton’s major accomplishments was the discovery of the first and second theorems 
concerning the gravitational fields produced by spherical and homogeneous material shells. 
Usually, the two theorems are proved by using the Gauss divergence theorem, as we will 
also do in Section 1.3. However, due to their importance, here we prove them from direct 
integration of Eq. (1.2), in the original spirit (while avoiding the subtleties of Newton’s 
awe-inspiring geometric proof; e.g., see Binney and Tremaine 2008; Chandrasekhar 1995). 

The first step is to show that the gravitational field produced by a generic spherical 
density distribution p(r) is radial. This is accomplished by arbitrarily fixing a point of 
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space x, by defining the direction of the z-axis so that” x = (0,0,7), and finally by changing 
the integration variables from Cartesian to spherical in Eq. (1.2). Integration over the angles 
(do it!) proves that the resulting field g is directed along the z-axis (i.e., along x) and 
hence is radial. The second step is to specialize the density distribution p to the case of 
a homogeneous and infinitesimally thin shell of radius R and total mass M, so that from 
Eq. (A.97) 

Mé(r — R) 
"Are? 
From the radial symmetry of the field, we can proceed to the explicit integration of the 
z-component of g at x = (0,0,r), and after some algebra (see also Exercise 1.2), we finally 
obtain 


ply) 


r= llyl. (1.4) 


(x) = 1+ — : f, ZR (1.5) 
x)= eh. OF : . 
6 2r2 Ir — R| 


where f, is the radial unit vector in spherical coordinates (see Appendix A.8). It follows that 
for r < R no field is present (Newton’s first theorem), while for > R the field coincides 
with that produced by a material point of mass M, placed at the origin (Newton’s second 
theorem). 

A somewhat delicate situation arises when considering the field produced by the shell 
on itself (i.e., atr = R). In fact, it is clear from Eq. (1.5) that the function g(x), evaluated as 
a limit for r — R, is discontinuous, with different left and right values. A natural question 
that arises is how to compute the field on the shell itself. Naively, a “reasonable” approach 
could be to fix r = R before performing the integral over the surface of the shell and to 
pretend that for symmetry reasons the integral over the azimuthal angle ¢ is to be performed 
before the integral over the colatitude 3%. With this approach, we deduce that a point on the 
shell experiences a radial force per unit mass given by 


GM 
B(x) = —Saaf. (1.6) 


the average value of Eq. (1.5) just inside and just outside the shell. However, a closer 
inspection of the integral reveals a more delicate situation (i.e., that the tangential compo- 
nent of g to the shell cannot be uniquely defined). For example, the student is encouraged to 
calculate the tangential component of g(0,0, R) with a different order of integration from 
the “natural” one (i.e., by fixing the angle @) and integrating first over O < 0 < m: the 
tangential component of the field now diverges for } — 0 (i.e., for the effect of the points 
of the material meridian line touching the point x). Technically, the problem is due to the 
fact that, in the present case, the integral in Eq. (1.2) is not absolutely convergent, and 
the result depends on how the integral is computed. Here, the lesson is that, as a safety 


2 For simplicity, vectors are represented in the text as row vectors. See also Footnote | in Appendix A. 
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rule, symmetry arguments should be used to evaluate integrals only after the integrals are 
known to converge. 


1.3 The Gauss Theorem and the Gravitational Field 


An alternative and more elegant derivation of Newton’s first and second theorems can be 
obtained by using the Gauss divergence theorem (see Appendix A.4). However, before 
addressing this problem in Section 1.3.1, we must establish a few fundamental properties 
of the gravitational field. 

We start by considering the application of the Gauss theorem to the gravitational field of 
a point mass, and so we evaluate first the divergence of the gravitational field in Eq. (1.1). 
It is a simple exercise (do it!) to show that for all x # y, the divergence of the field 
at x vanishes (the student may also wish to prove, by using the divergence operator in 
spherical coordinates, that the radial 1/r? field is the only radial field in 9t° with this 
property). Therefore, from the Gauss theorem, it follows that the flux of the gravitational 
field produced by a point mass across a generic closed surface 0Q, not containing the point 
mass, is zero. From the superposition principle, this conclusion immediately generalizes to 
the case of the flux of the field g produced by an arbitrary mass distribution placed outside 
the closed surface 0&2. 

Now, let us ask what happens to the flux if the point mass is inside the region Q. 
Clearly, being divg = 0 for x ¥ y, according to Eq. (A.112), the only possible nonzero 
contribution of the divergence to the flux integral can be due to what happens at the point 
x = y. We face two problems here. The first is that we cannot compute div g for x = y, 
simply because the field is not defined there. The second is even more worrisome because 
from a naive interpretation of integration theory, one could expect that the volume integral 
evaluates to zero even if we pretend that the “value” of div g at x = y is infinite, being a 
point of a set of zero measure. This would be in stark contrast to the fact that at this stage 
we can certainly imagine a region containing the point mass with a shape such that the total 
flux is strictly negative! The only logical conclusion is that the flux is not zero and that 
integration theory is (obviously) correct, but that div g for x = y is a more “complicated” 
object than a function with infinite value: we now show that this is the case. The idea is 
borrowed from complex analysis, and it substitutes into our problem a different problem 
that we can solve. So, as is shown in Figure 1.1, we remove from Q a spherical region 
of radius R centered on y, and we produce a two-dimensional “cut” (the dashed line c in 
Figure 1.1) connecting the surface of the inner hole with the external surface 002. With 
the exclusion of the spherical hole, our particle is now outside the resulting region, so that 


3 Perhaps the most famous such case is encountered in the computation of the Jeans mass (e.g., see Binney and Tremaine 2008; 
Shu 1992), when fixing to zero (the so-called Jeans swindle) the gravitational field produced by the unperturbed, infinite, and 
homogeneous three-dimensional density background. In fact, instead of integrating over spherical shells centered on the point 
of interest (with the result of a zero field from Newton’s first theorem), we could integrate over parallel density slabs of unitary 
thickness, at the left and right of the point, obtaining a conditionally convergent alternating series (do it!); see also Exercise 1.8 
for another example. A full discussion of the conditions for the existence of integral (1.2) can be found in Kellogg (1953). 
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Figure 1.1 Schematic illustration of how, by using the divergence theorem, we can prove that the 
flux of the gravitational (or electrostatic) field of a particle of mass m (or charge q) contained in a 
closed and regular (but otherwise arbitrary) region Q C 23 is —4 Gm (or q/€), independently of 
the position of the particle inside the volume. The dashed line indicates a generic two-dimensional 
“cut” with two opposite normals, one for each of the two sides of the cut. 


the flux of g through the total boundary (made by the original 0&2, by the surface of the 
spherical hole, and by the two geometrically coincident but analytically distinct surfaces of 
the cut c) evaluates to zero. Therefore, from the divergence theorem, the flux of g through 
dQ is simply the negative of the flux on the inner spherical hole, because the flux on the 
two surfaces of the cut cancels out. By reversing the normal at the surface of the hole and 
performing the surface integral by exploiting the spherical symmetry of g, it is now simple 
to show (do it!) that the flux on the hole,* and so onto 9Q, is —47.Gm. 

In conclusion, by using the Gauss theorem applied to the special case of the three- 
dimensional radial 1/r? field, we showed that for a point mass 


/ div g(x)d°x = / (g(x),n)d?x = —42Gm x Bs ee (1.7) 
Q 


aQ 1 yea 


In other words, div g behaves as the three-dimensional Dirac 6-function in Eq. (A.95) 
(technically a distribution), and we can formally write 


4 Quite obviously, even before computing it, we knew that the flux on the spherical hole is independent of the adopted value of 
R, which is consistent with the fact that the flux on @Q cannot depend on our arbitrary choice of R! 
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ees, 
IIx — yl? 


where the subscript indicates the coordinates used to compute the divergence. From the 


= 47 5(x— y), (1.8) 


div x 


superposition principle, we now conclude that, given an arbitrary mass distribution p(x) 
and an arbitrary closed (and sufficiently regular) region Q C %°, the flux of g produced 
by p through the boundary 0Q is —42%71GM, where M = ie pd°x is the mass contained 
in the volume. Highlighting an elementary but (perhaps) not always appreciated point is 
in order here. In fact, it is clear that a null flux over a closed surface 0Q does not imply a 
null field inside the region Q, as is obvious for the case of a point mass external to a given 
volume. In turn, this also means that, in general, one cannot prove that the gravitational 
(or electrostatic) field inside an empty cavity is zero simply by considering the null flux 
through some closed surface contained in the cavity. A naturally related question is then 
why inside an electrically charged, conducting, and closed surface of arbitrary shape at 
equilibrium not only the flux but also the field is zero (the so-called Faraday’s cage), while 
inside a material surface of similar shape the gravitational field is not zero, even though the 
force law in the two cases is mathematically identical. To properly answer this question, we 
must move to the next chapter, where we encounter the concept of gravitational potential, 
which is fundamental in stellar dynamics. 

We will now use Eq. (1.8) to obtain the differential equation relating the gravitational 
field of a mass distribution to its density. In practice, we compute? the divergence under the 
sign of the integral of Eq. (1.2), and from Eq. (1.8), we obtain the fundamental identity 


div g(x) = —47 Ge(x). (1.9) 


If we now insert into Eq. (1.9) the expression for the field of a point mass from Eq. (1.1), 
we deduce that 


p(x) = md(x— y), (1.10) 


or, in other words, that physically the Dirac 5-function can be imagined as the “density 
distribution” of a point mass. This shows in the most apparent way that the 5-function is 
a dimensional object with the inverse volume units of the space under consideration, and 
that, from a mathematical point of view, a point mass is “more” than just a simple point of 
“infinite density.” 


1.3.1 Newton’s First and Second Theorems Again 


We are now in the position to prove again, using a different approach, Newton’s first and 
second theorems, and to show the power of the Gauss theorem in Eq. (1.7). In fact, from 
the property that the gravitational field of a spherically symmetric mass distribution p(r) is 


5 Of course, this step can be rigorously justified (e.g., see Kellogg 1953). 
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radial (i.e., g = g,(r)f,-; see Section 1.2), and using a spherical control volume of radius r 
centered on the origin, Eq. (1.7) reduces immediately to 


4nr?g,(r) =—4nGM(r), M(r) = 40 is p(t)t7dt, (1.11) 
0 


where the (still unknown) component g, has been “factorized” out of the integral thanks 
to its geometric properties and to a wise choice for the integration surface. In practice, the 
gravitational (or electrostatic) field of a spherical mass (charge) distribution at distance r 
from the center is the same field of a point mass (charge) of mass M(r) or charge Q(r). 
Newton’s two theorems are immediately recovered from Eq. (1.11) for the case of p given 
the homogeneous shell in Eq. (1.4). The student is encouraged to follow the same line 
of reasoning and deduce the expressions analogous to Eq. (1.11) for cylindrical densities 
e(R) and planar densities p(z), taking into account the precautionary notes at the end of 
Section 1.2. 

A final warning to enthusiastic students: unfortunately, the powerful Gauss theorem 
is not a magic tool that can, elegantly and effortlessly, give the gravitational field of an 
arbitrary mass distribution while sparing the tedious work of integration. In fact, even 
though the Gauss theorem holds for arbitrary volumes and mass distributions, it should 
be clear that the derived expressions for g of spherical, cylindrical, and planar distributions 
are based on the exceptional circumstance that the geometry of the surfaces over which the 
flux integrand can be factorized is known before g is actually calculated. For generic mass 
distributions, the geometric properties of g are not known in advance, such that the question 
of what kind of surface one could/should use to repeat the treatment of special geometries 
is as difficult as the problem of determining g itself! 


Exercises 


1.1 With this exercise, we quantify the possibility of considering stars as point masses 
in real stellar systems. Consider an idealized stellar system of radius R and homoge- 
neously filled with N identical stars of radius Ro. We define a geometric collision 
as the situation realized when the centers of two stars are separated by a distance 
d < 2Ro. By considering the co-volume of N “cylinders” of length A (a rough 
estimate of the mean free path; e.g., see Born 1969) and radius 2Ro (why?), argue 
that an estimate of 4 can be obtained by imposing that the total volume of the 
cylinders equals the volume of the sphere, i.e., formally 


a = - (1.12) 
2R 6NR4, , 
Estimate A for an elliptical galaxy and for a globular cluster. 
1.2 Let 
x—y 
g(x) = ia Pa a < 3, (1.13) 
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a natural generalization of the classical gravitational field produced by a particle of 
mass m, where G is some universal constant (e.g., see Di Cintio and Ciotti 2011). 
Show that the radial component of the gravitational field produced at radius r by the 
homogeneous spherical shell of total mass M and radius R given in Eq. (1.4) is 


(y=- 24 7 (1.14) 
&&V) =—75 * p37 _ p37 pi-@ = piv . 
cae) (oe ee ee a 
3-a a-—l 
where r+ = r+ R and r_ = |r — R|. Discuss the relevant case of a = —1 (the 


harmonic oscillator) and show that for a = 2 we reobtain Eq. (1.5). What happens to 
the field atr = R? 

For x ¥ y, calculate the divergence of the field given in Eqs. (1.13) and (1.14) and 
discuss the result as a function of @ by using the Gauss theorem applied to concentric 
spherical control surfaces of increasing radius. Explain geometrically why inside a 
uniform shell a mass point is attracted toward the shell for @ > 2 and toward the 
center for a < 2. 

Consider a homogeneous sphere of constant density oo9, and from Eq. (1.11) show 
that the field inside the sphere is that of the three-dimensional isotropic harmonic 
oscillator 


4a Gpo 
3 


g(r) =— r. (1.15) 
By using the superposition principle and the Newton’s second theorem, determine 
the field inside a spherical hole — not necessarily concentric — carved in the sphere: 
isn’t the result beautiful? Hint: Imagine the hole is produced by the superposition of 
a sphere of negative density —o and calculate its repulsive field from Eq. (1.15). 
Point particles, rings, and disks are often encountered in problems of stellar dynam- 
ics. Prove that the three-dimensional density of a particle of mass m placed on the 
z-axis at distance a from the origin, in spherical and cylindrical coordinates, can be 
written respectively as 
d(r —a)d(P) _ nok —a)d(R) 


= = 1.16 
ani 2nrr2 sind 2nR Ute) 


Moreover, show that for a homogeneous ring of total mass M, radius a, linear density 
A. = M/(2ra), placed on the z = 0 plane, and with the center at the origin, 


_ uit — a)d(0 — 1/2) _ yok — a)6(z) 


= 1.17 
2rr2 sind 2nR ny) 


Finally, show that for a razor-thin disk of surface density &(R) on the z = 0 plane, 


= sip? @ = — scMse, (1.18) 
r sind? 
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1.7 


1.8 
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so that from Eqs. (1.17) and (1.18) it follows that the “surface density” of the homo- 
geneous ring can be written as 


Mé(r — a) MS5(R —a) 

Ling = nr = eT ee (1.19) 
What happens to Eq. (1.7) in the special case of the point mass placed on the bound- 
ary 082? Hint: Suppose the tangent plane to dQ exists at the particle’s position. Draw 
a little semisphere of radius € centered on the particle and use Eq. (1.7) to evaluate 
the total flux on the new resulting surface made by 0Q, minus the disk of radius e, 
plus the semisphere of radius €. Conclude that, no matter whether the semisphere 
includes or excludes the particle, the limit of the total flux through dQ for « — 0 is 
—2Gm, so that from Eq. (1.8) it follows that integration of the Dirac 6-function on 
a regular point of a boundary of a closed volume evaluates to 1/2. 
By using Newton’s second theorem and Eq. (1.2) twice, prove that two nonoverlap- 
ping spheres of masses M, and M> and centers at x; and x» attract each other as two 
material points of masses M, and M2 placed at x; and xp. 
This problem illustrates the danger of the naive use of symmetry arguments when 
calculating integrals. It is commonly said that the gravitational (or electrostatic) field 
of a homogeneous, infinite, razor-thin density distribution of surface density o is 
constant and directed normally to the plane “because the field components parallel 
to the plane vanish by obvious symmetry arguments.” Actually, it is easy to prove 
that within Newtonian gravity (or electrostatics) the horizontal field is undetermined 
(see also Footnote 3). Consider in the (x, y) plane four material squares of surface 
density o, with a common vertex at the origin. Let a be the length of the side of the 
square in the first quadrant, and by direct integration show that the gravitational field 
components at the point (0,0, z) above the origin are 


a a 
= gy = Go | arcsinh— — arcsinh ——— }, 1.20 
tx = 8) = Go (asin Ta) — 
where arcsinh(x) = In(x + V1 + x2), and 
a 
8: = —Go arctan————., (1.21) 
: za? + 27 


(e.g., see Kellogg 1953; McMillan 1958). Consider now the behavior of the total 
field g produced by the four squares of side length a, b, c, and d, when the sides are 
independently extended to infinity. Show that g, reduces to the value obtained from 
the Gauss theorem. What happens to the tangential components? 

As is well known, Newton’s first theorem can be seen as the result of a perfect 
cancellation at a generic point inside a homogeneous spherical shell of the r~* force 
produced by the two infinitesimal mass elements determined by the intersection of 
the shell itself and the two sides of the conus with an infinitesimal opening angle 
and a vertex in the considered point, followed by a rotation of the conus’s axis 
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over the whole solid angle (e.g., see Binney and Tremaine 2008; Chandrasekhar 
1995). Repeat this wonderful geometric argument for the case of a generic point 
inside a homogenous ring and conclude that the resulting attractive force is directly 
radially toward the nearest point of the ring (see also Exercise 2.33): therefore, when 
computing the gravitational field of a disk, its external regions cannot be ignored, a 
fact of great importance for the interpretation of the rotation of disk galaxies. 


2 


The Gravitational Potential 


In this second introductory chapter, the concept of gravitational potential is presented and 
then developed up to the level usually encountered in applications of stellar dynamics, such 
as the computation of the gravitational fields of disks and heterogeneous triaxial ellipsoids, 
the construction of the far-field multipole expansion of the gravitational field of generic 
mass distributions, and finally the expansion in orthogonal functions of the Green function 
for the Laplace operator. 


2.1 The Gravitational Potential 


In Chapter 1, we obtained the integral expression for the gravitational field g produced 
by a generic mass distribution, and we derived important information about its properties 
by considering the Gauss divergence theorem jointly with the superposition principle. In a 
certain sense, we could content ourselves with these results that in principle would allow 
us to compute (e.g., by using the powerful numerical techniques and computers available 
today) the gravitational fields of astronomical objects such as galaxies, stars, and planets. 
However, this “brute force” approach would lead almost immediately to a conceptual dead 
end: yes, we can certainly compute the gravitational field of each specific object with the 
desired accuracy, yet a full grasp of the general properties of the gravitational field would be 
precluded. As already stressed in Chapter 1, it is worth recalling that in physics we usually 
learn about a problem as much from the equations as from the solution, and accordingly 
we will now look at the mathematical problem behind the solution given by Eq. (1.2). 
Therefore, we repeat the approach in Section 1.3, where the results have been obtained 
by considering the divergence (div) of field g of a point mass, focusing now on the effects 
on g of the rotor (also known as the curl or rot) when applied to g, as well as on the 
consequences that can be derived from the Stokes theorem (Appendix A.4). First, however, 
it is important to answer a question that quite often arises in discussions with students: Why 
are div and rot so special that even the Maxwell equations for the electric and magnetic 
fields are written in terms of these two operators (e.g., Feynman et al. 1977; Jackson 1998)? 
We could certainly invent other differential operators at our leisure! The reason for this is 
elucidated by the Helmholtz decomposition theorem (see Appendix A.6 for more details). 
Here, it suffices to recall that this theorem states that in three-dimensional space every 
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reasonably well-behaved vector field can be almost uniquely! represented as the sum of two 
vector fields, one with null divergence (or solenoidal; 1.e., in a simply connected domain 
this component can be written as the rotor of a vector potential) and the other with null 
rotor (or irrotational or closed; i.e., in a simply connected domain it can be written as the 
gradient of a scalar potential, and so this second component is not only closed, but also 
exact). In practice, the Helmholtz theorem ensures that, when the divergence and the rotor 
of a vector field are known, then we know “everything” about the field itself. 

Therefore, we now consider the rotor of the gravitational field g produced by a generic 
density distribution p(x). From Eq. (1.1), it follows that for a point mass rot g = O for 
x # y, because the rotor of a radial field vanishes (prove it!); moreover, St? minus a point 
is simply connected, and so the field produced by a point mass is not only closed, but also 
exact (see Appendix A.4). Without loss of generality, for the field of a point mass we can 
then write 


g=-V69, (2.1) 


where (x) is the gravitational potential, and the minus sign in front of it is a standard 
convention in physics. The superposition principle now guarantees that Eq. (2.1) also holds 
for the gravitational field of a generic density distribution given in Eq. (1.2) (i.e., that the 
potential can always be defined once g exists); alternatively, the student could evaluate 
the rotor under the integral and show that in fact the rotor vanishes (i.e., that g is closed). 
Having established the existence of the potential, now the question is how to obtain the 
explicit expression of @(x) for an assigned p(x). Following the rules of vector analysis in 
Appendix A.4, 


(9) ~ 6000) =~ f (g(x), dx), (2.2) 


y (Xo, X) 
where Xg is an arbitrary point of space (with the only obvious condition that g is defined 
there), and y(xo,x) is a well-behaved but otherwise arbitrary path connecting the point 
xo to the point x. The student will surely note that the potential actually depends on two 
arbitrary choices (i.e., the position xo and the value of the gravitational potential at xq). For 
example, in the case of the point mass placed at y, the integration path in Figure 2.1 shows 
that the most general expression of the potential is 


Gm Gm 
IIxo—yll = Ix—yll’ 
The usual formula for the potential of a point mass is then recovered by fixing x9 = oo and 


(xo) = 0. 
Along the same lines, we can finally show that in the case of extended mass distributions 
the field is not only closed, but also exact (as expected from the superposition principle), 


p(x) = $(Xo) + 


xo #Y. (2.3) 


! The decomposition is almost unique because it is possible to add to the decomposition the gradient of a generic harmonic 
function (a gauge) without changing the divergence and the rotor of the resulting field. However, if the vector field is also 
assigned on a boundary 0Q of some prescribed (simply connected) volume Q (the Dirichelet problem in Figure 2.2), the 
decomposition is unique. 
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Figure 2.1 The integration procedure used to establish Eq. (2.3). The two points xg and x are 
connected by Path | from xo to Xp on the surface of the sphere centered on y and of radius ||xo — y]l, 
followed by the radial Path 2 from Xp to x’ aligned with the origin O and the position y of the point 
of mass m, and finally by Path 3 from x’ to x on the surface of the sphere centered on y and of 
radius ||x — y||. The integrals along Paths 1 and 3 are zero because (g,dx) = 0 by construction. 
For a generic spherical mass distribution centered at the origin, an identical treatment shows that 
d(r) = (79) - i gr (r’)dr', where g; is given by Eq. (1.11), so that Eq. (2.5) is finally proved. 


simply by constructing the function #(x). This is done by using Eqs. (1.2) and (2.2), 
exchanging the order between line and volume integration, and performing the path inte- 
gration as in the case of the point mass: 


1 1 i 
( ) ply)dvy. (2.4) 


(x) = 600) —G | 
kyl Toy 


x3 
Notice that if the density distribution decreases faster than ||y||~? for |ly|]| —> oo (the 
common situation of systems of finite total mass, when p decreases faster than |ly||~°, 
is contained in this case), it is possible to fix x9 = oo and @(xo) = 0. However, in stellar 
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dynamics, sometimes idealized stellar systems of infinite total mass, not included in the 
previous family, are encountered (e.g., see Exercise 2.1 and Chapter 13 for relevant exam- 
ples). In these special cases, x9 cannot be placed at infinity: however, if one expands the 
integrand in Eq. (2.4) for ||y|| — oo with xo at some finite distance from the origin, a 
cancellation of the leading term occurs and convergence is reestablished even for density 
distributions decreasing faster than ||y||~! for ||y|]| + 00 (prove it!). 

We conclude this section by proving a particularly important (and simple) formula for 
the potential of spherically symmetric mass distributions centered on the origin. By using 
again the approach depicted in Figure 2.1 from Newton’s second theorem, and after an 
exchange in the order of integration, one gets 


o(r) = d(ro) + . ae om arc | p(t)tdt, (2.5) 


r 


where M(r) = 47 te p(t)t?dt is the total mass contained in a sphere of radius r. The 
formula becomes particularly simple in the case of systems for which the limit M(r9)/ro 
is finite for rg — 00, so that the sum of the first two quantities on the right-hand side of 
Eq. (2.5) can be set equal to zero by considering r9 = oo and @(00) = lim,)_.59 GM (r)/To. 
Finally, it can be useful to remind the student that Newton’s second theorem applies to the 
force, not to the potential (i.e., the potential at distance r from the center of a spherical 
mass distribution is not the potential of a point of mass M(r) placed at the origin unless 
p(r) is truncated and r is outside the mass distribution itself). 


2.1.1 The Poisson Equation 


We can now combine Eqs. (1.9) and (2.1) and obtain one of the most important equations 
in physics, the Poisson equation 


Ad (x) = 47 Gor (x), (2.6) 


where A = divgrad is the so-called Laplacian (Appendix A.4); incidentally, by con- 
sidering the special case of the point mass, from Eqs. (1.10) and (2.6) we recover the 
correspondent of Eq. (1.8); in other words, we can prove that in i? 


Ax : = —475(x — y), (2.7) 
IIx — yll 
a fundamental identity in physics and mathematics. We will come back to this identity in 
Section 2.4. 
As we will see in the next section, from Helmholtz’s theorem the Poisson equation, when 
supplemented with the appropriate boundary conditions,” encapsulates all of the properties 
of the gravitational field, and it can be considered as the field equation of classical gravity 


2 Itis elementary to realize the importance of boundary conditions: for a given pair (p,), the Poisson equation remains 
unchanged if the harmonic function (Ax, x) + (b,x) + c (with A being a3 x 3 matrix with TrA = 0, b being a constant vector, 
and c being a constant scalar) is added to ¢, but g = —V@ would be different! 
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(and of electrostatics). In an empty space, Eq. (2.6) is called the Laplace equation, and its 
solutions are called harmonic functions. As anticipated, we can learn much more about g 
from inspection of Eq. (2.6) than from its solution given by Eq. (1.2). 


2.1.1.1 The Gravitational Field in Cavities 


We now show how Green’s first identity (Appendix A.5) can be used to prove that the 
solution of the Poisson equation (2.6) in a given simply connected region Q C %°, with 
assigned dependence (x) = h(x) on the boundary dQ (the Dirichelet problem; see 
Figure 2.2), if it exists, is unique. In fact, suppose that two solutions, ¢; and $2, exist for 
the problem. Then, due to the linearity of the Laplacian, their difference y = $1 — ¢2 
is a solution of the Laplace equation inside the volume, with null boundary conditions, 
because ¢; = $2 = fA on OQ. From the vanishing of the surface integral in Eq. (A.127) 
with f = g = ¥, it follows that ||Vy|| = 0 on Q (i.e., w = const. inside the volume). But 
w = 0 on 0Q, and so y = 0 everywhere on Q, and finally 6; = $2. Notice that a similar 
(but not identical) result is obtained if, instead of @, its normal derivative is assigned on 
dQ2 (the Neumann problem): in fact, in this case the surface integral in the Green identity 
vanishes; however, we can now only conclude that yy = const. (and so ¢2 = ¢1 + const.) 
on &2, as expected because only the normal derivative of ¢@ on dQ is assigned, but not the 
value of @. Finally, note that even if the Laplace operator is of second order, the unicity 
result proves that in general it is not possible to assign both the value of ¢ and its normal 
derivative on dQ, illustrating how in general boundary value problems are different from 
initial value Cauchy problems (where for a second-order operator both the function and its 
derivative must be assigned). 

A fundamental consequence of the unicity theorem is now apparent: suppose we have 
an empty region of space, bounded by a surface density coincident with an isopotential 
surface. Surely, @ = const. is a solution for the Laplace problem, and the unicity theorem 
proves that this is the solution (i.e., inside an empty cavity bounded by an isopotential 
surface density distribution, the gravitational field produced by the density distribution 
is zero). We therefore obtained a third proof of Newton’s first theorem as a spherical 
homogeneous shell of matter that coincides with an isopotential surface. It should now be 
clear why the gravitational field inside a closed cavity bounded by an infinitesimally thin 
density distribution o is not zero (even if the flux over all of the closed surfaces contained 
inside the cavity is zero from the Gauss theorem), except for very special geometries 
(e.g., the homogeneous spherical shell, or, with the precautionary notes in Chapter 1, 
inside a homogeneous and infinite cylindrical shell, or between two infinite and parallel 
homogeneous planes with the same surface density), while inside a cavity of arbitrary 
shape carved in a perfect conductor at equilibrium, the electrostatic field is always zero, 
independent of the shape of the cavity. This because in general a given surface material 
density does not coincide with an equipotential surface, while in the case of perfect conduc- 
tors (when charges can freely move until equilibrium is reached in the tangential direction 
to the surface) the bounding surface is necessarily an equipotential, even in presence of 
external charges (the so-called Faraday’s cage; see the beautiful discussion of the problem 
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Neumann 


—Q 


b = h(x) 
Dirichelet 


Figure 2.2 The Dirichelet and Neumann problems for the Poisson equation (2.6). As usual, we 
indicate the normal derivative of ¢ on 0Q with d¢/dn = (n,V®@). As explained in the text, in the 
Dirichelet problem if the solution exists, it is unique, while in the Neumann problem the solution is 
unique up to the addition of a constant. 


in volume II of Feynman et al. 1977). The interested student is invited to meditate on the 
following question: We reached the conclusion that a conducting surface shields the region 
inside from the electrostatic field of an external charge. However, if we put a charge inside 
a conducting surface, the field outside cannot be zero from the Gauss theorem. Why does 
the equipotential surface distribution of charges not shield the external region from the 
charges inside? After all, isn’t the exterior space a region “bounded at the bottom” by an 
equipotential? 


2.2 Newton’s Third Theorem 


Considering the deep nature of the unicity theorem, the fact that Newton was able to 
imagine and then prove by means of pure geometry the theorem that will now be illustrated 
is evident proof of his immense genius. This theorem concerns the gravitational fields 
produced by ellipsoidal figures, and it has far-reaching applications in stellar dynamics 
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(as stellar systems are often well described by ellipsoidal shapes, with the paradigmatic case 
of elliptical galaxies; e.g., see Chandrasekhar 1969), for rotating fluid equilibria (e.g., see 
Lamb 1945; Tassoul 1978), and in classical electrodynamics (e.g., see Becker 1982). For 
these reasons, Newton’s third theorem deserves a thorough discussion, and in the following 
we provide a fairly complete analytical proof of the theorem. 

The special geometric object considered at the outset is the so-called finite-thickness 
homeoid. Such an object is defined as the region of space m; < S(x) < mz contained 
between two concentric, coaxial, and similar ellipsoids described by the function 


3 
Sx) =, a >a > as; (2.8) 
i 


sometimes in the following we will assume x = x1, y = x2, Z = X3, and that of course one 
should distinguish between the function S(x) and the ellipsoidal surface that is obtained by 
setting S(x) = m. A finite-thickness homeoid filled with matter of uniform density po will 
be called a finite-thickness homogeneous homeoid. 

Newton’s third theorem states that the gravitational field produced by a finite-thickness 
homogeneous homeoid inside its cavity S < my, is null. Different proofs can be found in 
the literature (e.g., see Becker 1982; Binney and Tremaine 2008; Boccaletti and Pucacco 
1996; Chandrasekhar 1969; Kellogg 1953; McMillan 1958; Routh 1922), and here we will 
follow the approach based on ellipsoidal coordinates and on the use of the Gauss and unicity 
theorems. However, before going through the proof, it is useful to assume the truth of 
the theorem and to derive a few important consequences of it when combined with the 
superposition principle. First, if we define the finite heterogeneous homeoid as the union 
of homogeneous homeoids of density p = p; for mj < S(k) < mj41;, andO < m, < 
m2 < +++, we immediately deduce that the field produced in the cavity S(x) < my, is 
zero. Second, as nothing is said about the “thickness” of the regions mj < S(x) < mj+1, 
it is clear that the previous conclusion also holds when m is a continuous variable and 
p = p(m): we will refer to this object, which is of great importance in stellar dynamics, as 
the heterogeneous ellipsoid. Finally, if we consider an arbitrary point xo inside a heteroge- 
neous ellipsoid p(m), it also follows that g(xo) is determined only by the density distribu- 
tion enclosed by the ellipsoidal surface passing thorough Xo (i.e., for S(x) < S(xo) = mo, 
a wonderful generalization of Newton’s first theorem). 

A very elegant way to prove Newton’s third theorem, up to the level of obtaining the gen- 
eral expression for the gravitational potential of a heterogeneous ellipsoid, is to adopt ellip- 
soidal coordinates (see Appendix A.8; see also Bertin 2014; Binney and Tremaine 2008; 
Boccaletti and Pucacco 1996; de Zeeuw and Lynden-Bell 1985 and references therein). 
These coordinates, for the generic point x = (x, y, z), are defined by the three solutions for 
t of the cubic equation in t 


3 x? 
> - =f, (2.9) 


i=l qj ne 
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whose relation with Eq. (2.8) is apparent. Notice that for t = 0 the left-hand side of 
Eq. (2.9) reduces to the unitary ellipsoidal surface S(x) = 1; therefore, for a point on the 
unitary surface, t = 0 is one of the three solutions. It is an interesting exercise (do it!) to 
show that for arbitrary x the three solutions (A, 4, v) of Eq. (2.9) are real, labeled with the 
; <v< —a; << —a; < X. For assigned (A, jz, v), the substi- 
tutions T = v, T = yu, and t = A in Eq. (2.9) produce three families of confocal quadrics 
mutually intersecting at right angles at the position x, whose ellipsoidal coordinates are 
in fact (A,j,v): the student is encouraged to visualize the geometry of the coordinate 
surfaces, and possibly to work out the axisymmetric cases, leading to oblate and prolate 
ellipsoidal coordinates. In particular, 7 = A > —a} corresponds to the family of triaxial 
confocal ellipsoids, and for very large values of A the ellipsoidal surface becomes more and 
more spherical, with a radius asymptotically approximated by \/A, so that the pair (jz, v) can 
be to some extent interpreted as the pair of “angles” in spherical coordinates, determining 
the position of the point on the surface of the ellipsoid A = const. In Appendix A.8 
(where we also derive the explicit formulae for all of the relevant differential operators 
expressed in ellipsoidal coordinates to be used in the following discussion), we derive the 
explicit transformation formulae between (A, u,v) and (x,y,z): from Eq. (A.146), note 
how the physical units of the ellipsoidal coordinates are length squared, how their values 
depend on the semiaxes (a, a2,a3) of the unitary ellipsoid S(x) = 1, and finally how they 
are degenerate (i.e., a given triplet (A, 1, v) actually corresponds to eight points in 9%? with 


conventional ordering —a 


all of the combinations of +x, ty, and +z). 

Having introduced the ellipsoidal coordinates, we are now in a position to prove 
Newton’s third theorem. We first restrict ourselves to the surface S(x) = 1, and we solve 
the Laplace equation Ad = 0 in the two distinct regions S(x) < | and S(x) > 1. In 
particular, we require that in the external region (A > 0) the potential can be written 
as @ = (A) (i.e., the isopotential surfaces are ellipsoids that are concentric, coaxial, and 
confocal with the surface S(x) = 1). Therefore, once #(A) is found, # (0) is the value of the 
potential on the unitary surface, and from the unicity theorem this is also the constant value 
of the potential for points with S(x) < | (Le., for —az <2 < 0). After the determination of 
(A), we will search for the infinitesimally thin surface density distribution o; coincident 
with the unitary ellipsoid S(x) = 1 and producing such a potential: the superposition 
of an infinite number of such isopotential surface density distributions will finally prove 
Newton’s third theorem. 

From Eqs. (A.147) and (A.157), the Laplacian in ellipsoidal coordinates can be 
written as 


4ASA(X 0 0 4./—A 0 0 
Ag ia | vaa-e | + vy AM) E aw se | 


~ @=MG—v) aH O— WU —v) on 


av) _@ | vaore" (2.10) 


(A — v)(u — v) dv av 
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where? 
A(t) = (a? + t)(a2 + 2)(a2 +7). (2.11) 


From the assumptions ¢ = @(A) for A > 0 and @(A) > 0 for A + o, the solution of the 
resulting Laplace equation for S(x) > 1 is immediately obtained as 


CO 
=, > oaa=-caf 
dh JAA) rox) VAT) 

where C| is an arbitrary constant with the dimensions of a mass (prove it!) and A(x) > 0 
is the positive solution of Eq. (2.9) for generic x outside the unitary ellipsoid. The potential 
(0) in the region S(x) < | is simply given by Eq. (2.12) with A = 0. As is shown in 
Eq. (2.67), in the triaxial case @(A) is expressed in terms of elliptic integrals, while in the 
axisymmetric case the integral involves standard trigonometric functions. 

Having determined the function @(A), we now face the problem of the existence of a 
surface mass density o;, geometrically coincident with the surface S(x) = 1, which pro- 


(2.12) 


duces such a potential; by construction, o; will then coincide with an isopotential surface. 
We apply the Gauss theorem to each element of the surface S(x) = 1 as follows: by 
construction, A = 0 labels the isopotential surface S(x) = 1, and so the outer gradient of ¢ 
is perpendicular to the surface itself, and from Eq. (A.149) 


f, do 
Vol|,-9+ = —— : (2.13) 
$la=0 meer 
Moreover, inside the surface, the potential is constant with V@|,—9- = 0, and from the 


Gauss theorem (1.7) applied to an infinitesimal cylinder with a lateral surface parallel to fy 
and crossing the surface, 
do — a 

4nGh; da|,-o+  20./jv’ 
where the final beautiful identity derives from Eqs. (2.12) and (A.147). Notice that the 
obtained density distribution is positive everywhere, so it can be intepreted as a mass 
density. Of course, Eq. (2.14) can now be used the other way around: for a given ellipsoidal 
surface density of the family oj = A1/,/j4v, we must then fix C; = 277 A, in Eq. (2.12). 

We now have to find the physical meaning 0; (u,v). This is obtained from the co-area 
theorem (Appendix A.9). In fact, from Eq. (A.162), we can rewrite the potential (2.4) of 
the heterogeneous ellipsoid (for the moment in the case of finite total mass) by breaking it 
into a sum over infinitesimally thin density layers of nonconstant surface density: 


p(m)d>y __pim)d?y " 
$x) =—G / ee [va mf =| dntivam, 
ow (Ix— yll 0 S(y)=m IVSIIx—yl Jo 


(2.15) 


o1(H,v) = (2.14) 


3 Note that from the ordering of A, jz, and v, it follows that A(A) => 0, A(z) < 0, and A(v) = 0; see also Eq. (A.147) and the 
associated footnote. 
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where d’y indicates the surface area element of the ellipsoid S(y) = m and @y indicates 
the potential produced by the associated co-area surface density o, = p(m)/||VS||s=m. 
The fact that o,, is not constant over the surface S = m, even though p(m) is constant, 
is explained by the geometric interpretation of the co-area theorem (see Figure A.2). In 
Exercise 2.4, we prove the amazing result that for the m = 1 ellipsoidal surface associated 
with the density distribution p(m), 


1 1 
pS ee. Gaara eal, (2.16) 


IVS |ls=1 Juv 

in other words, that the co-area surface density o;, once expressed in ellipsoidal coordi- 
nates, has exactly the functional dependence of the isopotential oj (u,v) in Eq. (2.14), so 
that Eq. (2.12) also provides the potential generated by o; once C is fixed as in Eq. (2.16). 

In order to integrate Eq. (2.15), we now resort to the superposition principle, combined 
with the result just established. In practice, for given x we add all of the contributions 
$m (xX) produced by o,, of each stratum S = m. Some care is needed, because it should be 
obvious that each stratum S = m is characterized by its own ellipsoidal coordinates, which 
we indicate generically with t,,. Notice that the ellipsoid S = m corresponds to a unitary 
ellipsoid of semiaxes ma;, so that the equivalent of Eq. (2.9) becomes 


3 x? 


ae =1; (2.17) 


i= 2a? ne 


the three coordinates‘ of x are labeled as (Am; Lm; Ym), and the transformation formulae are 
immediately obtained from Eqs. (A.146) and (A.147) with the substitutions a; + ma; and 
T +> T,. By repeating the treatment used for the unitary ellipsoid m = 1, one obtains the 
generalization of Eqs. (2.12), (2.14), and (2.16), where (prove it!) 


Cin aara3p(m)m 2 
On = = => Cm = 20 a\a2030(m)m~, (2.18) 
i 20 Vm Yn VJ LmYn = 


so that finally 


[ee 


dm (X) = —27 Gayaya3p(m)m? / dim (2.19) 


Ain (x) V Am (Tm) : 


where Aj (Tj) is obtained from Eq. (2.11) with the substitution a; + maj and t + Tp, 
and finally 2, (x) is the positive solution of Eq. (2.17) if the point x is outside the surface 
S = m, and zero otherwise. Therefore, by combining Eqs. (2.15) and (2.19), we have a 
quite formidable expression for the potential of a triaxial ellipsoid, and we can proceed to 
integrate @» over m in Eq. (2.15). 

First, for the given position x, we compute the critical value me = S(x). The integral 
over m then splits into two integrals, the first over 0 < m < me and the second over 


4 In the nomenclature of Eq. (2.17), the parameter t in Eq. (2.9) would be indicated as t = tT. 
In Exercise 2.5, we prove by direct integration that the surface integral of o,, over the ellipsoid A, = 0 evaluates to the 
identity 2Cm = Mm, where Mm is the mass of the shell. We will return to this point when discussing multipole expansion. 
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Me < m < &. For the latter integral, x is inside the ellipsoidal shells, and the value of 
Am(X) in Eq. (2.19) is zero. In the first integral, we proceed as follows: we recognize that 
Eq. (2.17) can be rewritten as 


Tn 
m 2? 


5) =m’, T= 
Pas 


(2.20) 


so that, if A(m,x) is the positive solution for t in Eq. (2.20), then A(m,x) = A »(x)/ m?. 
We are almost done: we now change the variable in the inner integrals to t,, = m?t, so 
that Am (tm) = m>./A(), then we invert the order of integration between m and t from 
Eq. (2.20) due to the fact that in the integral over 0 < m < m¢- we have A(m¢-,x) = 0 and 
A(0,x) = ov, and finally we combine all of the resulting expressions to obtain @(x) (and 
the proof of Newton’s third theorem!). It is quite astonishing that the illustrated procedure 
is finally summarized the following remarkable formula (e.g., see Binney and Tremaine 
2008; Chandrasekhar 1969; Kellogg 1953): 


$(x) — (xo) = —G i ey (2.21) 
A) = Xo) = —G7aj,a2a3 T, : 
0 VAC) 
where 
m(Xo0,T) 3 2 
AW(t) = 2 f p(t)tdt, m?(x,t)= io —, (2.22) 
m(X,T) i=l a; +T 


and where we introduced (by subtraction) the potential at the reference point xo in order 
to also allow for the treatment of infinite-mass ellipsoids. Due to the highly nontrivial 
proof of Eq. (2.21), it should be reassuring for the student to be able to show that #(x) 
does in fact obey the Poisson equation (2.6), and Exercises 2.6 and 2.7 are devoted to this 
important aspect. Moreover, in some astronomical application, it is convenient to consider 
the potential and the gravitational field outside a truncated ellipsoidal distribution, and in 
Exercise 2.8 we derive a specialized version of Eq. (2.21) tailored to these cases. 

Two comments are in order. The first is that, with (x) being given by the sum of 
the potentials 6, produced by each isopotential surface density oj, and with each ¢», 
being confocal with o,,, it should not be a surprise that the isopotential surfaces produced 
by a generic ellipsoidal distribution are not in general ellipsoidal (see Exercise 2.9 and 
Figure 2.3). Second, a notable situation is represented by the special family of constant- 
density ellipsoids in Eq. (2.74) when the resulting potential inside the ellipsoid is a 
quadratic function of the Cartesian coordinates, because from Eq. (2.75) 


AW(t) = p(0) x [m? — m*(x,7)]. (2.23) 


As is shown in Egs. (2.67), (3.11), (3.12), (3.62), and (3.63), the coefficients of the quadric 
are in general elliptic integrals in the ratios g; = a3/a, and gy = az/a,, while in the 
axisymmetric cases, they reduce to inverse circular functions. Physically, inside a constant- 
density ellipsoid, orbits are then harmonic oscillators (in general with different frequencies 
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Figure 2.3 Qualitative illustration of the relative shapes of two ellipsoidal surfaces S = mg and 
S = mp (with foci at Fg and Fy) and the two associated isopotential surfaces @g (dotted line) and 
op (dot-dashed line), contributing to the potential at the position x. Note how the isopotential ¢q (x) 
is rounder than ¢,(x), consistent with the confocal shapes of @, in Eq. (2.19). 


along the three Cartesian axes). In Chapter 13, we will consider another special family of 
ellipsoidal galaxy models, the so-called Ferrers ellipsoids, whose potential can be written 
explicitely from Eq. (2.21). We will also briefly describe the important family of galaxy 
models obtained when starting from a gravitational potential written by using ellipsoidal 
coordinates, and among them the very special class of Stdckel models. 


2.2.1 Homeoidal Expansion 


It should be fairly evident that the potential generated by a generic ellipsoidal density distri- 
bution usually cannot be expressed in explicit, closed form. However, and not unexpectedly, 
the limit for small flattenings of Eq. (2.21) is remarkably simple, and the obtained formulae 
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are flexible enough to allow the construction of galaxy models that depart from spherical 
symmetry and that can be used as guidelines for more detailed studies (e.g., in the Jeans 
modeling discussed in Chapter 13, in orbit computations in ellipsoidal stellar systems, or 
in the construction of hydrostatic equilibria for the hot gaseous atmospheres of triaxial 
galaxies and clusters of galaxies). In the following, we will refer to this framework as the 
homeoidal expansion technique (e.g., see Ciotti and Bertin 2005; Ciotti and Pellegrini 2008; 
Ciotti et al. 2006; Lee and Suto 2003; Muccione and Ciotti 2004; Schwarzschild 1979). 

In order to illustrate the method, we consider a spherically symmetric density distribu- 
tion of finite total mass M, which we write as 

7 


P(r) = pop(s), s=—, (2.24) 
a} 


where /o is a density normalization constant, O(s) is the dimensionless density profile, and 
a, is a length scale. Without loss of generality, we fix the density scale pp so that 


M [oe] 
po = > [ p(s)s*ds = 1. (2.25) 
1 


We now transform the spherical density (2.24) into an ellipsoidal density of the same total 
mass with the substitution 


(2.26) 


where the new density scale py will be determined later. Without loss of generality, we 
assume a, > a2 > a3, and we define the two ellipticities 


a2 a3 
—=q=1-e., —=q,=1-n, O<e<n<l: (2.27) 
ay . 1 


in the spherical limit, gy = gq; = 0 and € = n = O. It is an elementary exercise (do it!) 
to show that in order to conserve the total mass M of the new ellipsoidal distribution, 
independently of the values of € and 7, we must have® 


oe: = ium (2.28) 
(d—ed—m ”~ arae 

We now perform the expansion for small values of € and 7 of the density and of the 
associated potential: as the Poisson equation is linear in the pair (9,@) and powers in 
€ and 7 are linearly independent, it follows that by performimg expansions at any order 
and selecting the functional coefficients of given orders in € and n, the obtained functions 


P(X) = pPnp(M), Pn = 


6 Notice that M, instead of being the otal mass, could be the mass contained in a sphere of some prescribed radius sg and inside 
the ellipsoid m = so: in this case, the normalization integral (2.25) extends up to sg. Moreover, the assumptions of a finite total 
mass M for the density profile in Eq. (2.24) and the independence of M from the density flattening can be easily relaxed. The 
student is encouraged to work out the formulae for models of infinite total mass and/or when the total mass is not fixed. A 
simple way to proceed is to consider py in the first part of Eq. (2.28) as a fixed-density scale independent of flattening, so that 
Wn = 4 Gpnaz (1 — €)(1 = n) in Eq. (2.30). 
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describe exact, nonspherical density—potential pairs obeying the Poisson equation. For 
example, at the first order in the flattenings, 
p'(S) y Zz 


1 _pagcess@e eer’... a2, 52, C29) 
(6-7) 5 ai ai 


Of course, the density at the right-hand side of Eq. (2.29) is physically acceptable only if 
it is positive, and in Exercise 2.11 we derive a general inequality to be satisfied by € and n 
in order to ensure the positivity of the expansion. Furthermore, in Exercise 2.12 we prove 
that the potential in Eq. (2.21), produced by the density in Eq. (2.28), can be written as 


e GM 
$(x) = —Vnd(xX) Yn= ar (2.30) 
and that fore > 0 and n > 0 
b(X) ~ In(s) + (€ +) h(s) + (63? + 02) (s), (2.31) 


where 


Ip(s) = -[ p(t)t7dt + i. p(t)t dt, 
0 S 


h@= af pit)ttdt + =| p(t)t dt, (2.32) 
3s 0 3 s 


1 Ss 7 4 
In(s) = -= | p(t)t dt. 
S~ JO 


A couple of explicit examples based on galaxy models often used in theoretical and obser- 
vational works are discussed in Exercises 2.13 and 2.14, while further considerations are 
deferred to Chapter 13. 

Notice that the homeoidal expansion can be used not only to calculate with good approx- 
imation the potential of slightly ellipsoidal systems, but also to generate genuine non- 
spherical density—potential pairs when considering the truncated expansions for finite (not 
necessarily small, provided positivity is guaranteed) values of ellipticities. For example, 
it is an instructive exercise to show that, by combining the ordering argument and the 
superposition principle, the following is a density—potential pair for arbitrary values of 
the constants a, 6, and y: 

Pd 
px) = (a8? + py + 2) 2™, 
s (2.33) 
G(x) = (@ + B+y)LU(s) — Lo(s)] + (ak? + BH + 2) DOs). 


From this point of view, the presented technique can be seen as a particular case of a 
general approach based on the linearity of the Laplace operator, which can be exploited to 
generate families of density—potential pairs by the introduction of suitable parameters in a 
known density—potential pair and then performing differentiation (or integration) of both 
members of the Poisson equation with respect to the parameter. As a few examples, we 
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may recall models presented in Miyamoto and Nagai (1975), Nagai and Miyamoto (1976), 
Satoh (1980), and Vogt and Letelier (2009a), obtained starting from the Kuzmin (1956) 
disks (see also Binney and Tremaine 2008; Toomre 1963), the models of Williams and 
Evans (2017) (see also Long and Murali 1992), and the model of Letelier (2007), or finally 
the complexified models discussed in Chapter 13. 


2.3, Multipole Expansion 


One lesson from the previous section is that we cannot hope to obtain closed-from expres- 
sions for the potential of a generic density distribution (except in very special cases), and so 
more powerful and flexible methods must be developed to tackle the problem when dealing 
with the modeling of stellar systems. A first significant step in this direction is to consider 
the gravitational field at large distances from an object of finite size and total mass M but 
otherwise of arbitrary shape: at the present stage, the only requirement (to be relaxed) is 
that the density p(y) is zero outside some sufficiently large spherical volume enclosing the 
body (see Figure 2.4). In fact, in this case we can obtain the expansion of the potential for 
points outside the enclosing sphere as an absolutely (and so uniformly) convergent series. 


-. -- 


Figure 2.4 The geometry of the multipole expansion for the potential #(x) produced by a density 
distribution o(y) vanishing outside a sphere of radius a. 
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This is obtained as follows: we consider Eq. (2.4) with #(xo) = 0 and ||xo|| — oo, we 
expand the function ||x — y|| = V IIx? + |lyl]2 — 2(x,y) for ||x|| > @ in ascendig powers 
of t = |lyll/||xl| < 1 by using Newton’s binomial theorem in Eq. (A.50), and finally we 
perform termwise volume integration of the resulting series. The obtained expansion can 


be used to obtain the far-field expansion of the gravitational field of a generic body, so it 
is particularly useful in the investigation (for example) of the motion of distant globular 
clusters around a galaxy, or of a satellite around a planet, and so on. In practice, the general 
far-field expansion of the gravitational potential produced by p(y) is given by 


$=¢ot+bit+g2+---, (2.34) 


where it is an elementary execise of algebra to show (do it!) that the terms up to quadratic 


order are 
GM (x, Rom) (Ox, x) 
do=-——, $1 =-GM—_—2—, hh=- ; (2.35) 
I|x\| IIx| 2\)x\I° 
where 
1 
Qi; = u / yi yj — llyI?S;)e(yd?y, (2.36) 
x3 


which is the quadrupole tensor; higher-order terms become algebraically more and more 
complicated. Note that ¢o (the monopole) is independent of the position, shape, and ori- 
entation of the body, ¢; (the dipole) depends on the position Rcm of the center of mass 
(and so can always be set to zero by fixing the origin in Rcm), and ¢2 (the quadrupole) 
depends on the shape and orientation; more and more details about the structure of p(y) 
are captured by higher-order multipoles. Note also that Q;; is a second-order symmetric 
tensor, with TrQ;; = 0, so that in general only five integrals must be computed. However, 
as Q;; can always be expressed in diagonal form with a suitable choice of the orientation of 
the reference system, actually only two integrals must be computed in the reference system 
aligned with the eingenvectors, reducing to just one integral in the case of axisymmetric 
systems. Finally, for a spherical system centered at the origin, the previous considerations 
show that Qi; = 0. Therefore, at large distances from a body with its center of mass in the 
origin, @o is already a quite good approximation to the true field, as the first nonzero term 
is the quadrupole (or higher-order multipole) term. Of course, Newton’s second theorem 
proves immediately that outside a spherical body with Rcy in the origin, all of the multi- 
pole terms, except the monopole, vanish identically. A peculiar property (that we will not 
prove) of multipole terms, which is particularly relevant in electrostatics, is that if the first n 
terms in the expansion (2.34) are zero, then the value of the n + 1-th term is independent of 
the position of the origin (e.g., see Jackson 1998). Of course, in gravity this property is not 
applied, as ¢o9 4 0 for mass distributions.’ We conclude this discussion with a comment 
about the potential of ellipsoids discussed in the previous section. As proved in Exercise 2.5 


7 In electrostatics, when the total charge of the system is zero, the multipoles cannot be defined “per unit charge” as we do for 
gravity by explicitly factoring out the total mass M, as, for example, in Eq. (2.35). 
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with a direct and nontrivial integration, for the ellipsoidal stratum in Eq. (2.18), the constant 
Cm evaluates to M,,/2, half of the stratum’s mass. Now, from multipole expansion we 
know that at large distance r from the stratum, 69 ~ —GM,,/r: at the same time, we can 
integrate Eq. (2.19) by noting that for large r, Am(x) ~ r and so Ain (Tn) ~ 1,/*, anda 
trivial integration proves again, in a much simpler way, that 2C;, = My. 


2.4 The Green Function for the Poisson Equation 


The idea of multipole expansion can be seen as a first (and extremely useful) attempt to 
elaborate a general and systematic strategy to find the solution to the Poisson equation (2.6) 
for arbitrary mass distributions. In fact this is possible, and it is based on the construction 
of the so-called Green function for the Laplacian: the logic behind the method is described 
in Appendix A.7. Of course, the literature on Green functions is immense, as the theory 
applies to generic linear differential operators, with applications in physics ranging from 
gravitation, to electromagnetism, to radiative transport, to quantum mechanics and electro- 
dynamics, and so on. According to Eq. (A.133), it is apparent that the Green function for 
the Laplacian is given by the solution of the equation 


AG = 5(x — &). (2.37) 
The student may wonder why we should invest time in figuring out a method to solve Eq. 
(2.37): after all, from Eq. (2.7) we already know a solution: 


G(x,&) = — (2.38) 


An ||x — & ||’ 
and in fact its substitution in Eq. (A.134) reproduces (as expected) Eq. (2.4). Are we just 
going around in circles? Why bother with additional mathematics if we reobtain a known 
result? There are two main reasons for this. The first is that in the present case of the 
Laplacian operator, we are in the fortunate and quite exceptional situation of knowing G in 
advance, but in general the constructive approach is the only way to find such a solution. 
The second reason is that once we know how to construct G for a given operator, we 
can do this in different coordinate systems, and so obtain different representations for the 
Green function. In our case, we will obtain different representations for the function in the 
right-hand side of Eq. (2.38) and, even more importantly, in terms of special (orthogonal) 
functions, with a sort of “factorized” dependency on the coordinates x and &, instead of 
having them “mixed” in a nonlinear way as in Eq. (2.38). 


2.4.1 Cartesian Coordinates 
We begin with the case of Cartesian coordinates (x, y,z), when from Eq. (A.119) 
em a 


A= 
ax2 ay 


(2.39) 
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Cartesian coordinates are perhaps less interesting from the point of view of stellar dynam- 
ics; however, their simplicity allows us to illustrate a few important points of the procedure 
without additional mathematical difficulties. Moreover, a few situations of astrophysical 
interest can be imagined, such as the computation of the potential of inhomogeneous mass 
distributions as in thin disk galaxies. As is pointed out in Appendix A.7, the first step is the 
search for solutions Go of the homogeneous problem, which we then must combine in some 
clever way to obtain a solution for Eq. (2.37). A standard approach that works in several 
coordinate systems is to use the method of the separation of variables, and we then assume 
that Go = U(x) V(y)W(z), with the three factors to be determined (e.g., see Jackson 1998). 
Quite obviously, as we are searching for a solution in factorized form, there is ample room 
for different choices of the constants of separation, and in general the choices are suggested 
by the nature of the problem and by experience. In the present case, we separate first the 
x and y variables by computing AGo/Go = 0, and we call the two constants —k2 and 
—;2: 


3; Moreover, we indicate with k = (k,,ky) a two-dimensional vector. In this way, we 


obtain for the functions U and V (stationary) plane waves,® while the remaining function 
W is given by a combination of real exponentials (do it!). As the Laplacian is a linear 
operator, we are now free to sum an arbitrary number of such solutions, with arbitrary 
weights (in general dependent on k), and a general solution for the homogeneous problem 


can be written as 
1 P a eee ee 
Go(x) = =| [Sz anyel eri + G (k)e!(&¥) «] d’k, (2.40) 
ne 


where k = ||k|| and y = (x,y). We now take advantage of the special nature of plane 
waves, the building blocks of Fourier transforms (see Appendix A.3), to attack the Green 
function. In fact, we now write 


g=—f Cane ak, (2.41) 

2m Jx2 
and we determine G (k, z) as follows: we insert Eq. (2.41) into Eq. (2.37) and we com- 
pute the Laplacian, we Fourier transform both members of the resulting identity, and we 
obtain a one-dimensional Green problem that can be treated with the method described in 
Appendix A.7 (see Exercise 2.21). Finally, by inserting the result into Eq. (2.41), we obtain 


1 1 elky—m—kle-Bl 
Ix—€l| 2x [, k dk, n= (1,2), k= (KI. (2.42) 


2.4.2 Spherical Coordinates 


Thanks to the experience gained in the previous section, we now consider the (slightly) 
more complicated case of spherical coordinates (7,7, @g), when from Eq. (A.157) 


8 Stationary plane waves are therefore eigenfunctions for the Laplacian, with eigenvalue - = IKI. 
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a= ssa (5) + saan ae (Sap) + garg apt = ort SE oe 
where the meanings of the radial (A;) and the angular (Ag) components of the Laplacian 
are apparent. We search again for a solution for the empty space in the family Go = 
U(r) V(O)W(@) (1.e., we solve AGo/Go = 0). We separate first the azimuthal component 
and we require, for obvious reasons, that the resulting W function is periodic (i.e., we 
require that after a rotation of 27 around the z-axis the solution is unaffected”), obtaining 
(do it!) 


ele 


J2n° 


or in other words, rotational invariance of space requires that the separation constant is 
—m?, with m being an integer number, the classical azimuthal quantum number! The 
obtained functions are an orthonormal, complete set (i.e., they can be used to represent 
functions (provided convergence is assured) over the interval 0 < g < 27 as Fourier 
series; see Appendix A.3). We then move to separate the resulting equation for the function 
V, and the student is invited to prove that we obtain the Legendre differential equation 
(A.71): as discussed in Appendix A.2.3, the only acceptable solutions for our problem are 
given by the Legendre associate functions of the first kind P/’(cos #), with? = 0,1,2,..., 
and —1 < m < J, the classical second quantum number. This because these are the 
only Legendre functions that are regular at ® = 0 and } = z; as in our problem there 
are no reasons for a singularity along the arbitrary choice of the z direction (again, this 
is a condition to be relaxed in several problems of electromagnetism, such as a conic 
conductor), this forces a quantization of the V; functions. Remarkably, the P/” functions 
are also an orthogonal, complete set at fixed m for 1 > |m|, and again they allow us to 
expand in Legendre series functions of 1. With these two families of functions, and after 
normalization to the norm of P/", we can construct one of the most important objects of 
physics and mathematics: the spherical harmonics 


Wm (y) = 


m € Z, (2.44) 


21+1(—m)! . 
Y"" (0,0) = .| ——— ———_P" (cos # )e’”””, 2.45 
1 (3,9) ie Caml | ) (2.45) 
and it is now a simple exercise to show that integration over the whole solid angle (do it, 
first integrating over ~) 


/ YOO, OPC 2 =sping Y= Cp"y,”, (2.46) 
An 


where d*Q = sin 9*didq (i.e., the spherical harmonics are a complete system of orthonor- 
mal functions on the surface of the sphere). Therefore, for a given function f(%,@g) with 
0 < » < 2m andO < } < z, wecan write 


9 In several problems of electromagnetism, this requirement is not necessary, such as in a conductor with the shape of an orange 
segment. 
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f@9) = 3 ye ws ine [ fO.@)Y]*(@.p)d’2, (2.47) 


1=0 m=-—I 
the so-called Fourier-Legendre expansion!® of f. The student should also prove that 
AgYy = -I@+ DY?, (2.48) 


a fundamental identity in the discussion of angular momentum in quantum mechanics (e.g., 
see Messiah 1967). In Exercise 2.24 we solve the radial equation for U(r) produced by 
separation of variables, and from Eq. (2.99) we finally obtain the general solution for the 
homogeneous problem as an arbitrary linear combination of the particular solutions 


ee i 
Go(x) = > [se m)r' + So lyre (2.49) 


1=0 m=—I 


we will use this identity in Exercise 2.25. 

We now proceed along the same lines of the previous section, and thanks to spherical 
harmonics we represent the still unknown Green function as a Fourier—Legendre series with 
generic weight functions depending on the radius: 


love) I 
G= D> Do Gim(r)YP (0.9). (2.50) 

1=0 m=-—l 
In order to determine G, we now require that Eq. (2.37) is obeyed: as we are using 
spherical coordinates, in the following we indicate with (r,v,@) the coordinates of x, and 
with (,0’,g’) the coordinates of . Therefore, we compute (do it!) the Laplacian of G in 
Eq. (2.50) by using Eq. (2.48), we transform the three-dimensional 5-function in spherical 
coordinates from Eq. (A.97), and we Fourier—Legendre transform the obtained identity 
by multiplication for a generic x5 and integration over the solid angle. The procedure 
leads to a one-dimensional Green problem for the function Gi (r), already solved in 
Exercise 2.24, and from Eq. (2.101) we finally prove that in spherical coordinates 


1 
yn (?, p)Yy* (8, ¢’) rl 
= 4 : a (2.51) 
BH mp2 +1 


where r< = min(r,é) and rs = max(r,é), and (prove it!) from Eqs. (2.45) and (A.74) 
Y7"*(9,9) = (-)"Y,"(0,¢). (2.52) 


Thus, for a generic density distribution expressed in spherical coordinates p = p(&,0',¢’), 
if we define 


Pim (&) = [ peo’. g')Y7"* (8',9)d? 2", (2.53) 


10 In practice, the double series in Eq. (2.47) is obtained by first Fourier expanding f(y, 0) over y, then Legendre expanding 


fm(#) over 9 by using P/” at fixed m, and finally recognizing that (prove it!) 7-50 > 1>|m| = 2 ro > dys 
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ey 0, rl 
jiesne yy. a ; erin dé. (2.54) 
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Before concluding this section, it is worth pointing out two important consequences of 
what we have done. The first is that in Eq. (2.51) the direction of the z-axis is fully arbitrary, 
and of course the identity also holds when the z-axis is aligned along &: it is a nice exercise 
(do it!) to show that in this special case the spherical harmonics in Eq. (2.51) reduce to 
Legendre polynomials, i.e., 


oo oe 
=~ Pi(cos #) 75 aT" (2.55) 
1=0 


1 
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We obtained a closed-form expression for the (messy) case of the computation of higher- 
order terms of multipole expansion in Cartesian coordinates (consider r > &) and the proof 
of the generating function of Legendre polynomials in Eq. (A.83). Notice that a comparison 
of Eq. (2.51) with Eqs. (2.34) and (2.35) shows that the multipole term of order n contains in 
general (at most) 2n + | different quantities, given by the spherical harmonics at fixed / = n 
(e.g., compare with the generic monopole, dipole, and quadrupole terms); for example, 
the octupole potential (which we did not compute) contains a third-order tensor of 27 
components, but the previous argument shows that actually at most only seven genuinely 
different quantities are involved! Finally, following Jackson (1998), we obtain the very 
important spherical harmonics addition formula by starting from Eq. (2.51), rotating the 
coordinates in the left-hand side to place & along the new z-axis (and this of course leaves 
the value of the left-hand side unchanged), and then comparing with Eq. (2.55): as powers 
are linearly independent, we have 


l 


> Y7' (8, p) Y7"* (0,9) = Pi(cos y), (2.56) 
m=—I 


4a 
21+ 1 


where y is the angle between x and & (i.e., from the inner product) and cosy = 
cos 0 cos }” + sind sin? cos(y — 9’). 


2.4.3 Cylindrical Coordinates 


We can finally solve the problem of the construction of the Green function for the Laplacian 
in cylindrical coordinates (R,g,z), one of the most used coordinate systems in stellar 
dynamics. From Eq. (A.157) we have 


1 a a 1 a a? 
A= R 2.57 
aan ( a)tpimt een 


We search again for a solution for the empty space in the family Go = U(R)V(z)W(@) (ie., 
we solve AGo/Go = 0 by using the separation of variables). As for spherical coordinates, 
we first separate for the azimuthal angle g and, repeating identical arguments, we again 
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obtain for the constant of separation —m? for the family of the complete set of orthonormal 
Fourier W,, functions given in Eq. (2.44). Next, we separate the resulting equation for the 
z-coordinate, and we use as a separation constant the quantity k*, with k being a positive 
real constant; this is done in order to obtain V functions that can present an exponential 
decline!! for |z] > 00. We can now move to studying the last differential equation for the 
radial function U(R): the student is invited to prove that we obtained the Bessel equation 
(A.84). As discussed in Appendix A.2.4, the second-order, linear differential equation 
admits two independent solutions, and Fuch’s theorem shows that the only regular solutions 
at the origin are the Bessel function of the first kind J,, with m integer. These functions are 
orthogonal with weight R over the range 0 < R < oo and complete, and they allow us 
to represent radial functions as Hankel transforms, as shown in Eqs. (A.88) and (A.89). 
We now combine an arbitrary number of the obtained solutions, and we recover a general 
expression (do it!) for the homogeneous solution as 


oo img 
m=—oo \ 2m 


We now proceed along the same lines as the previous section, and we represent the 
still unknown Green function in the Fourier—Bessel form, with generic weight functions 
depending on z, 1.e., 


Go(x) = 


[Gi Gombe + Go (ksme™ [In(KR) dk. (2.58) 


oo eine 
j= >= =| On(k,2)Im(KR)K dk. (2.59) 
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In order to determine G, we now require that Eq. (2.37) is obeyed: as we are using 
spherical coordinates, in the following we indicate with (R,y, z) the coordinates of x, and 
with (&,z’,g’) the coordinates of &. Therefore, we compute (do it!) the Laplacian of G in 
Eq. (2.59), we transform the three-dimensional 6-function in cylindrical coordinates from 
Eq. (A.97), and we Fourier—Hankel transform the obtained identity by multiplication for 
a generic e'”?/./27 element and integrating over y, followed by multiplication for a 
generic J, (/R) element and performing a final integration over RdR. The procedure leads 
to a one-dimensional Green problem for the function Gn (k,z), which is already solved in 
Exercise 2.21, so that in cylindrical coordinates we finally have 


eim(g— ay eT Fe—21) (KR) Jin (KE) dk. (2.60) 


ae nwo 
It follows that for a generic density distribution expressed in cylindrical coordinates 
p = p.¢',z’), 


ev img’ 


TE 


inthe) = fo Imitsyeds f"* pE,g',z') dy’, (2.61) 


Wl Again, this choice is motivated by the fact that in astronomical systems the potential is needed over the whole space: in 
electromagnetism, often the value of the potential is assigned on planes at some finite distance from the equatorial plane, and 
other choices for the separation constant are more appropriate (e.g., see Jackson 1998). 
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the potential can be written as 


and eine ioe) ioe) te-2' 
o(x) = —2xG Im (kR)dk / eHz-21 5 (kz dz’. (2.62) 
p2) am I m = m 


The identities obtained are of especially important in Stellar Dynamics for the mathematical 
treatment of disks and rings, and in the exercices some of the most famous cases of razor- 
thin disks are presented. The student is reminded that other methods exist to recover the 
gravitational potential of disks, and some of these additional techniques are discussed in 
more advanced literature (e.g., see Binney and Tremaine 2008; Evans and de Zeeuw 1992; 
Freeman 1966; Hunter 1963; Kalnajs 1976a,b; Mestel 1963; Toomre 1963 and references 
therein; see also references in the exercises in Chapter 5). 

A final comment is in order: the student will notice in the exercises a quite significant use 
of tables of integrals, and they could ask why, with the availability of powerful computer 
algebra systems, one should spend time looking through deeply boring tables. Experience 
shows that looking through tables greatly helps us to understand the properties of our 
integrals, and that while computer algebra is an invaluable help for research and numerical 
experiments, it should not (like tables) be trusted blindly when searching for the closed 
forms of difficult integrals, especially when they depend on parameters. For these reasons, 
the student is invited to enjoy the experience of repeating at least a few of the suggested 
exercises. 


Exercises 


2.1. Consider the so-called singular isothermal sphere density profile 


6 a (2.63) 
ON?) te Gr? eet , 


where vu, is the constant circular velocity of the model (prove it!). By using Eq. (2.5), 
show that the potential can be written as 


o(r) = v2In—, (2.64) 
rO 


where ro is an arbitrary scale length. 
2.2 Consider the gravitational field produced by the razor-thin, planar mass distribution 


p(x) = o(y)d(z), (2.65) 


where y = (x,y) and o(y) is the surface density of the stratum. By using Gauss 
theorem, show that the jump of the z-component of the field across the surface is 


gt — g, =—4nGo(y). (2.66) 


Repeat the convergence argument in Eq. (2.4) and find the asymptotic behavior of « 
as a function R = ||y|| so that ||xo|| — co can be assumed. Discuss the result along 
with that of Exercise 1.8. 
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2.3. Prove that the integral appearing in Eq. (2.12) when calculating the potential pro- 


2.4 


2.5 


duced by the unitary isopotential ellipsoidal shell S(x) = 1 can be written as 


ee 
arctan =i 
qe +h 


co dt 2 
wir@@l= f | = ee 
a(x) VA(T) ai,/1—q? | arctanh = a 


F(@,k), 


where qy = a2/d), qz = a3/a}, 


2 1—q2 1 — q2 
j= *® ee 2 ge] (2.68) 
ay 1+2 1-@q: 


and F(g,k) is the elliptic integral of the first kind in Eq. (A.62). The three expressions 
in Eq. (2.67) hold respectively for the oblate case around z (qy = 1), the prolate case 
around x (gy = qz), and the triaxial case with d = 0 for points inside and on the shell 
(i.e., for S(x) < 1). Evaluate the spherical limit of the expressions above and show 
that Newton’s first and second theorems are recovered. Hints: The integration of the 
axisymmetric cases is elementary, while for the triaxial case, use eq. (238.00) in Byrd 
and Friedman (1971) or eq. (3.131.8) in Gradshteyn et al. (2007). For the evaluation 
of the spherical limit, use Eq. (2.9). 


Prove that the surface density associated via the co-area theorem with the unitary 
ellipsoid for the three-dimensional density p(m), once expressed in ellipsoidal coor- 
dinates, is given by Eq. (2.16). Hints: First show that in Cartesian coordinates 


me n y2 22 
a 4? 


IVS|]sa1 = Z 
1 ®& 4% 


(2.69) 


and then use Eq. (A.146) with A = 0. A more elegant proof can be obtained by 
differentiation of the two sides of Eq. (A.145) with respect to t 


x2 y? 2 
(aj +1)? r (aj+t)?  (aj+r)? 
_ (t—4u)(t— v) d (t—w)(t—v) 
= KG +(e -N ro iene (2.70) 


and then setting tT =A=0. 


Prove by direct integration in ellipsoidal coordinates that the surface integral of oj, 
in Eq. (2.18), over the ellipsoid A, = 0, evaluates to 2C;, = Mm, where Mj, is the 
mass of the shell. Hints: From Appendix A.8, 
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——— 4 dv 
x Ja a Ja v)(v — a3)(v — a3)(a? 


yy(a3 — w)(u— 3) 
(2.71) 


where the factor 8 takes into account the degeneracy of ellipsoidal coordinates, the 
Lamé coefficients are evaluated at 2, = 0, and we changed variables as uy, = —m? LL 
and v, = —m?v. The double integral can be expressed in terms of complete elliptic 
integrals, and from eqs. (3.131.3-5), (3.132.2-5), and (8.122) in Gradshteyn et al. 
(2007), it evaluates to 277, completing the proof. 
Show by differentiation of Eq. (2.21) that the gravitational field produced at x by an 
heterogeneous ellipsoid, along the direction e;, is given by 

ap 


[oe] 
gi (x) = —-— = —27Gaja2a3xj i 
OX; 0 


p(mz) dt 


——_—_—_—., 2:12, 
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where m, = m(x,T) is defined in Eq. (2.22). 
Show that @(x) in Eq. (2.21) obeys the Poisson equation. Hints: Use Eq. (2.72) and 
the fact that for a generic function f(m-), where m; = m(x,T), we have (prove it!) 


a "og a Pate ae d 
Fed eS Pt Fe fo (2.73) 
Ox; My a? +T cha 2m, rar (a? +t) dm, 


With this exercise we derive the explicit integral formula for the potential of a trun- 
cated ellipsoid 


pr(m) = p(m) x O(m, — m), (2.74) 
where @ is Heaviside’s function in Eq. (A.99). Show that when x is outside the 


ellipsoid (i.e., S(x) > mt), Eq. (2.21) can be written as 


mt 


dt, av(r) =2 f p(t)tdt, (2.75) 


m(x,T) 


) x) = TA, a2a3 
Ar(x) V (tT) 


where A;(x) > 0 is the positive solution of Eq. (2.20) for m = m, while for S(x) < 
m, the formula above holds with A; (x) = 0. Finally, in analogy with Eq. (2.72), obtain 
the expression for the field g outside the ellipsoid considering that 


3 2 1 
OAt 2X; x; 
ee wes eC; (2.76) 
Oxi = Ayta? bp Gat =| 


Hints: As the ellipsoid is truncated, its mass is finite and we can fix x9 = oo and 
(Xo) = 0. Then, from the obvious identity 


2.9 
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[o,@) 
AW,(t) = 2 | or(t) tdt = AWV(t) x O[m — m(x,T)], (2.77) 
m(X, T) 
Eq. (2.75) is recovered. Equation (2.76) is proved by differentiation of Eq. (2.20) 
with m = m, and tT = A(x). 
Consider an ellipsoidal potential, 1.e., 


d=o(m), m= />z+at+>: (2.78) 
and show that 


1 1 1 x>  y* 2? \ hm) 
Ap =| ak ey | PUR) | aoe aS : (2.79) 
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ay ay m 


where h(m) = ¢'(m)/m and ¢'(m) = d¢(m)/dm. Can an ellipsoidal density distri- 
bution produce an ellipsoidal potential? What is the mass of a system associated with 
an ellipsoidal potential? Hint: Consider the monopole term in Eq. (2.35). 

From Egs. (2.23) and (2.75), show that the gravitational potential at interior points of 
a homogeneous ellipsoid is a quadratic form of the coordinates. Use the superposition 
principle to prove that the field in an arbitrary ellipsoidal cavity carved inside a 
homogeneous ellipsoid is a linear function of coordinates (see also Exercise 1.4). 
Show that the positivity of the homeoidally expanded density distribution in 
Eq. (2.29), obtained from a spherical parent distribution with dp/ds < 0, is guaran- 
teed for 


d\n f(s) 


1 
ee S deg, Am = sup 71 
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max(€,7) O<s 


(2.80) 


Prove that for a parent density profile 6 « s~7(1 + s”)~° one has Aq = a + be, 
so that Ay = 4 for the important family of ellipsoidal y-models in Eq. (2.83), with 
n < 1/3 in the axisymmetric oblate case (€ = 0) and 7 < 1/2 in the axisymmetric 
prolate case (€ = n). Hint: Rewrite Eq. (2.29) in spherical polar coordinates and 
recast the positivity request as 

€ sin? 0 sin? g + ncos? 8 


1>A 2.81 
= Am ieedy (2.81) 


over the closed region 0 < 3 < m and 0 < » < 27. What is the analogous inequality 
if mass conservation is not required, as indicated in Footnote 6 of this chapter (see 
Ciotti and Bertin 2005; Ciotti et al. 2020)? 

By using the general formula (2.21) for the potential of ellipsoidal distributions, 
shows that in Eq. (2.30) 


a 1 fe Aw@ . . 
(x) = if dt, t=-—, (2.82) 
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where all of the lengths are normalized to a; and AW = Aw /Pn is obtained 
from Eq. (2.22) applied to 6. By expansion for € — 0 and 7 — O, then recover 
Eqs. (2.29)—(2.31) and show that the Poisson equation is satisfied. 

2.13 From Eq. (2.28), show that the mass-conserving ellipsoidal generalization of the 
spherical y-models (Dehnen 1993; Tremaine et al. 1994; see also Chapter 13) is 
given by 


3-—y 
~ 4a (1—©)(1— n)my¥(m + IY” 


p(x) O0<y <3, (2.83) 


where M is the total galaxy mass, so that from Eq. (2.29) 
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p(x) ~ (2.84) 


Moreover, from Eq. (2.32), show that for the ellipsoidal Hernquist (1990) model 


(vy = 1), 

Io(s) = : 

o(s) = i= 
2+5 2In(1 +s) 

I(s) = , 

1(s) Sd 45) 3 (2.85) 
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while for the ellipsoidal Jaffe (1983) model (y = 2), 
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The formulae for generic values of y are given in Ciotti and Bertin (2005). Repeat 
the exercise for the model in Eq. (2.93), and for the ellipsoidal generalizations of all 
the spherical models in Section 13.2. 
2.14 By direct computation of the Poisson equation (spherical coordinates are suggested), 
show that the toroidal density 
R? ~ RR , r 
P=—z, KES r=, (2.87) 
re 


ry ry 


where 2 < a < 5and py andr, are density and a length scales, respectively generates 
the potential (obtained from homeoidal expansion in Ciotti and Bertin 2005) 
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R2 
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30) 6p 
2.15 Convince yourself that if a// of the multipole terms in Eq. (2.34) vanish up to the order 

n, then @n+1 is independent of the position of the origin of the reference system. Of 
course, this theorem is only relevant in electromagnetism (e.g., see Jackson 1998). 


b = 4Gpnr2 x (2.88) 


(a = 4). 


2.16 Consider a spatially untruncated density distribution. What happens to Eqs. (2.35) 
and (2.36) and to higher-order terms? Discuss the possible divergence of Q;; Hint: 
Also consider the expansion of ||/x — y|| for |ly|| > ||x|]| and compare it with the 
expansion in spherical harmonics in Eq. (2.51). 

2.17 Show that (when they exist) the quadrupole tensor Q;; and the inertia tensor 3;j are 
related as 


MQij = THSij)8ij — 33ij, Sj = [ Aly |?6i3 — vivpeyd?y. (2.89) 


Express 3;; and Q;; in terms of the second-order mass virial tensor J;; in Eq. (10.45) 
and show that 


. Tr(3;;) 
Sig = TrUij)dij — ij, Tij = a 


dij —Sij, MQij = 34ij — TrUij)6i;- 

(2.90) 
Deduce that Qij, %jj;, and J;; are all diagonal in the same reference system and find 
the relation between their eigenvalues. Note that Tr(3;;) = 2Tr(Ji;). 


2.18 Show that for the heterogeneous ellipsoid in Eq. (2.28), with finite mass M and finite 
extension (i.e., 0 = 0 form > m), 3;; and Qi; are in diagonal form, with 


2 2 2 2 2 
a,+a;, 2a; — a+ — a; 
Si = MAb, Qi =—— > —h, (2.91) 
where i #4 j #£k, and 
foe) CO 
h= / p(m)m*dm, / p(m)m2dm = 1. (2.92) 
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Under the convention aj > az > a3, deduce that O < 34, < S22 < 333 and 
O1; => Qo > Q33. Finally, use Eq. (2.35) to write the potential 6 = ¢o + ¢2 in 
spherical coordinates and discuss the shape of the isopotentials ¢ = c forc — 0. 
Hint: Solve a cubic equation for r(%, g,c), or (better) perform a standard asymptotic 
analysis to obtain r ~ ro(c) + 6r(3,¢,c). 

2.19 An important ellipsoidal density profile with peculiar orbital properties is the perfect 
ellipsoid (de Zeeuw and Lynden-Bell 1985): 
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p(x) (2.93) 
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Modify this density profile, and those of the ellipsoidal y-models in Eq. (2.83), 
introducing a truncation ellipsoidal surface m; so that o = 0 form > m,. Call M; the 
total mass inside m;, and for these truncated models calculate h in Eq. (2.92). 

By using Eq. (2.91), show that for a finite-mass, spatially truncated ellipsoid axisym- 
metric around the z-axis (i.e., a, = a2), O11 = Q22 = —Q33/2, and discuss the 
sign of Q33 as a function of flattening a3/a,. Show that in the equatorial plane the 
quadrupole potential at distance R from the origin is 


GM Q33 GM (a; — a3 
= ARs = 6R3 
Therefore, at radius R in the equatorial plane, the gravitational field obtained from 
obo + $2 of an oblate ellipsoid is stronger than that of a sphere of the same mass, and 
in turn this is stronger than that of a prolate ellipsoid of identical mass. 


Le (2.94) 


Show that the problem of the Green function for the Laplacian in Cartesian coor- 
dinates, with the separation of variables adopted in Section 2.4.1, leads to a one- 
dimensional Green problem of the family 


—~ — kG = A8(z — &3). (2.95) 


Determine the constant A. By using the method in Appendix A.7 and restricting to 
solutions so that G — 0 for |z| — oo, show that 
7 e~klz—831 


G =-A— —. (2.96) 


Hint: The associated homogeneous problem is a constant-coefficient differential 
equation. 

Consider a razor-thin density distribution 9 = o(7)6(&3), where n = (&1,&2). From 
Eq. (2.42), show that the gravitational potential (provided convergence is assured) 
can be written as 


etky)-klel 
$(x) = an ikl o(k)d"k, x= (x,y,7) = (Y,2), (2.97) 
Ht 


where ¢ is the Fourier transform of the surface density. Then show that the Gauss 
theorem holds. Finally, reobtain the same result by using Eq. (2.40) to compute the 
field above and below the plane and by imposing the validity of the Gauss theorem 
on the jump of the field through the plane. 

Show that the Green function for the n-dimensional Laplace operator in Cartesian 
coordinates AG = 6(x — &) can be obtained by direct Fourier transform: 


1 eithx—8) 
= d’k. 2.98 
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In case of 3, show the equivalence of the obtained expression with that in 
Eq. (2.42) by using the residue theorem of complex analysis by integrating over k3. 


2.24 
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Hints: Write G as a Fourier antitransform from Eq. (A.102), apply the Laplacian and 
obtain G(k, xo) by Fourier transforming the obtained identity, and finally resum the 
result. Notice that the integration path depends on the sign of z — &3. 


Show that the separation of variables for the Laplacian in spherical coordinates leads 
to the homogeneous equation for the function U(r) 


B 
rA,U =10+1)U, U=Ar' + aE (2.99) 
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Then consider the Green function and show that the procedure leads us to solve the 
one-dimensional radial Green problem 
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5 LL+1)Gim 
2: 


r Tr 


Determine the constant A. By using the method in Appendix A.7, using the result in 
Eq. (2.99), and finally restricting to solutions that are regular at r = 0 andr —> ov, 
show that 


Ar! 


45 _ < 
Gim(r) = es eal 


re =min(r,é), rs =max(r,é). (2.101) 


Hints: The radial equation for U is a second-order, linear equidimensional Euler 
equation that can be solved using standard methods (e.g., Bender and Orszag 1978; 
Lomen and Mark 1988). 


Reobtain Eq. (2.54) from the homogeneous solution in Eq. (2.49) by using the Gauss 
theorem. Hint: Fix a radius € and impose that the jump in the radial component of 
the field is due to a infinitesimally thin material shell. 


Consider the razor-thin density distribution o = o (€, y’)6(&3) and obtain the formula 
for the potential from Eqs. (2.61) and (2.62). Show that the obtained expression 
obeys the Gauss theorem. Then show that the same formula can be obtained from 
the homogeneous solution in Eq. (2.58) by evaluating the gravitational field above 
and below the disk and imposing the jump condition from the Gauss theorem. 


From Eggs. (2.53) and (2.54), show that the potential of an axisymmetric system in 
spherical coordinates p(r, i) can be written as 


oo co 4l 
(1,0) = —20G }Pi(cos 8) [ ara) §7dé, 


1=0 (2.102) 


Tw 
Ate) = [ p(e,9'VP1(c0s 9") sin’ dv" 
0 
where re = min(r,&), r5 = max(r,&), and (y is the the Legendre integral transform 


of o; then show that in the spherical limit Eq. (2.5) is reobtained. From the same 
system in cylindrical coordinates p(R, z), show that Eqs. (2.61) and (2.62) reduce to 
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[o.@) [o.@) , 
$(R,z) = -206 | Jo(kR)akf ee! 6(k, 2')dz’, 
0 


tics a =a (2.103) 
x , POKK, & ! 
k, — eee : Jo(k d ; 
Bie!) = OE [ p(E.z/Mol(kE)E dé 


where ((k,z’) is the zero-th-order Hankel transform of p according to Eq. (A.89) 
and (o(k, z’) is the zero-th-order Fourier—Hankel transform of p. 
Axisymmetric, razor-thin disks are idealized but very important astrophysical config- 
urations. By using Eqs. (1.18) and (2.102), show that in spherical coordinates, for a 
razor-thin disk with surface density &(r) in the equatorial plane, 


i u(E)P2(0) 
pa(é) = a parié) = 0, 
(2.104) 
$(r,8) = ~29G > Py (cos 9)P (0) ee ast DOE dg, 
1=0 
where re = min(a,r) and rs = max(a,r); then specialize the formula for the 


potential #(r,0). For the same disk, show that in cylindrical coordinates Eq. (2.103) 
reduces to 


(kz) = 6(2)E(k) = ae | X(E)Jo(kE)E dé, 

00 : (2.105) 
$(R,z) = -2nc | e213 (k)Jo(kR)dk, 

0 


and that particularly in the equatorial plane 


f(R,0) = -26 [~ LY(ko(kKR)dk = -46 [~ U(E)EK (=) ee (2.106) 
0 0 rs r 


= 


where re = min(R,&), r~ = max(R,é), and K is the complete elliptic integral of 
the first kind in Eq. (A.65). Hints: Use Eq. (A.78) to prove that in Eq. (2.104) only 
even terms appear. Equation (2.106) is established from Eq. (2.105) by setting z = 0, 
inverting the order of integration, and finally using eqs. (6.512.1) and (8.113.1) in 
Gradshteyn et al. (2007) to prove that 


/ Jo(ka)Jo(kb)dk = Pe (2) , a>b. (2.107) 
0 wa a 


By using Eqs. (1.17) and (2.102), show that for a homogeneous ring of radius a and 
total mass M, in spherical coordinates 


6(E — a)P2/(0 = 
woe éu+1(E) = 0, 


” 2 (2.108) 


o(r, 0) = —GM > Ps(cos 8 P21 (0) 45> ae 
1=0 


bu(é) = 
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where r=. = min(a,r) andr. = max(a,r). For the same ring, show that in cylindrical 
coordinates Eq. (2.103) reduces to 


de olka) 


6(k,z') = 
p(k, Zz) = 


o (2.109) 
0(R,z) = -cm | e*ElJg(ka)Jo(kR)dk. 
0 


Alternatively, obtain the same expressions by using Exercise 2.28 and Lying in 
Eq. (1.19). 

When observed face on, the stellar (surface) density distribution of disk galaxies can 
be qualitatively represented as (Freeman 1970) 


X(R)= Doe *, R= R/Ra, (2.110) 


where Rg is the disk scale length and Ry ~ 1.6784Rg is the half-mass radius. !2 
Calculate the total mass of the disk and then the quadrupole tensor, showing that 


Mg = 2n XRG, Q11 = Q22 =3R3, O33 = —OR}. (2.112) 


Evaluate the potential in cylindrical coordinates from Eq. (2.105) and show that in 
the equatorial plane 


P R R R R 
$(R,0) = —7GXpRaR c (5) Ky (5) I, @ Ko () |. (2.113) 


where I, and K,, are the modified Bessel functions in Appendix A.2.4. Repeat 
the exercise in spherical coordinates by using Eqs. (2.104) and (A.51). Hints: Use 
Gradshteyn et al. (2007). First, from eq. (6.621.1), prove that 


LoRF 
(1 + A2)3/2’ 


oo = A= kRa, (2.114) 
and insert Z(k) in the second identity of Eq. (2.105) with z=0. Consider 
eq. (6.552.1), differentiate both sides of the identity with respect to the parameter a, 
and set a = 1; the left-hand side is the desired integral and the right-hand side is 
reduced to its final form with the aid of Eqs. (A.92)-(A.94). Notice that Eq. (2.113) 
can be obtained in a different, clever way, as is shown in Eq. (2.164a) in Binney and 
Tremaine (2008). 


The half-mass radius can be expressed (do it!) in terms of the Lambert-Euler W function, a multivalued complex function 


defined implicitly as 


We =z, (2.111) 


(e.g., Corless et al. 1996). 
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Mestel (1963) introduced a truncated disk with the surface density in cylindrical 
cordinates 


= a) Ma = 27 LaRaRt, (2.115) 


where Xq and Rg are the surface density and length scales, respectively, 0(x) is the 
Heaviside function in Eq. (A.99), R; is the disk truncation radius, and Mg is the total 
mass. When Ri — ov, we obtain the so-called untruncated Mestel disk. By using 
Eq. (2.104), obtain the potential in the equatorial plane and show that 


Ri +> Px OP a 

r =r 2i+1 r : 

R, [Px (O)P | 42 +1 pe 
In—+1 ; 
va Ds U |Wet \k 


l=] 


d(r,0) = —27GuqRa x 


(2.116) 


where the first expression holds for r > R; and the second holds forO < r < R; 
(discuss the convergence of the series). Show that for r — oo and fixed R;, the 
expected expression of the monopole potential is obtained. Finally, discuss the behav- 
ior of the potential for the untruncated case (Rt — oo). Hint: For the treatment of 
the untruncated case, reread the discussion after Eq. (2.4) and use the second part of 
Eq. (2.116). 

Due to the conceptual importance of the Mestel disk, we now obtain the closed 
form of the potential in the disk by using cylindrical coordinates, and we show the 
equivalence with the expression in Eq. (2.116). Show that Eq. (2.106) for the Mestel 
disk becomes 


6 
$(R,0) = —4GE4Rq / K (=) a (2.117) 
0 


where 6 = R;/Ra, Pr = min(R,é), % = max(R,é), & = &/Rg, and R = R/Rg. 
Verify the equivalence with Eq. (2.116). Hints: The equivalence of Eqs. (2.116) and 
(2.117) is established thanks to the identity 
XK [o,@) 
= 2,21 
K(k) = 5 YS“ [Pa/(0)] Kk“, |kl <1, (2.118) 
1=0 
proved by series expansion of K(k) in Eq. (A.65) for |k| <1, according to 
Eqs. (A.49) and (A.50). Term-by-term integration, the use of Eqs. (A.53), (A.54), 
and (A.46) in this order, and finally comparison with Eq. (A.78) prove the result. 
Notice that the function 


B (0) = EoRake {JoR) + 5 WiG)Ho®) — Jo@Hi@)I}, A =kR, 2.119) 
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which is not used to obtain Eq. (2.117), is expressed in terms of Struve functions H, 
by using eq. (6.511.6) in Gradshteyn et al. (2007). 

Show that the potential of an axisymmetric system p(R,z) in cylindrical coordi- 
nates can be written in terms of a complete elliptic integral of the first kind K in 
Eq. (A.65) as 


= - PE, 2')dz’ ARE 
R,z) =—4G d K 
os i : ef. V(R + &)? + (Az)? Jaa -_ 


(2.120) 


where Az = z — Z’ (e.g., see Binney and Tremaine 2008); note that, at variance 
with the functions appearing in orthogonal expansions, the integrand is positive 
everywhere. Specialize Eq. (2.120) to the equatorial plane of the razor-thin disk in 
Eq. (1.18) and to a generic point of space for the ring in Eq. (1.17), and show that 


$(R,z) = a ais (2.121) 
an Reapee LV Rta +e | | 


Compute the radial force of the ring as a function of R in the z = O plane (see 


also Exercises 1.9 and 2.29). Hints: Due to rotational symmetry, compute @ at x = 
(R,0,z) and reduce integration over gy’ to the first quadrant with the duplication 
formula cosy’ = 2cos?(g’/2) — 1. Notice that the complete elliptic integrals in 
Legendre form can be expressed indifferently by using sin or cos in the integrand. 

In a nonrelativistic formulation, the field equation of modified Newtonian dynamics 
(the so-called MOND, a proposed alternative to dark matter; e.g., see Bekenstein and 
Milgrom 1984; Milgrom 1983; see also Sanders 2010) is 


div E (Rot) von | =4nGp, (x) = ——— 
ag , ca Jl +x 


where ap is a characteristic acceleration. In the strong (i.e., Newtonian) regime 
(x>1, uw ~ 1), Eq. (2.122) reduces to the Poisson equation, while in the deep 
MOND regime (x < 1, uw ~ x) 


(2.122) 


div (||V¢mll Vom) = 42 Gaop, (2.123) 


where the left-hand side is a p-Laplacian. Show that the MOND potential yy and 
the Newtonian potential @ of a given barionic distribution, are related as 
Vv 
»( Rent) Vom =Vo+h, h=VAAwm, (2.124) 
ao 
where Ay is a vector potential, and that in the spherical case the solenoidal field h 


can be fixed to zero. Deduce that in spherical systems the MOND and Newtonian 
fields are related as 
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GaypM 
Véull = Vaol Vol > Vou = oa? 2. (2.125) 


where M (r) is the mass of the distribution inside radius r. Conclude that the MOND 
potential for a point mass is logarithmic and that the rotation curve at large radii for 
finite-mass systems stays flat without need of a dark matter halo. For an application 
of homeoidal expansion to the construction of analytical nontrivial axisymmetric and 
triaxial MOND density—potential pairs, see Ciotti et al. (2006). 

Show that for a < 3 the Riesz potential produced by the density distribution p (x) 
with the additive force law in Eq. (1.13) is 


ete: ier) 2 
= — } ply)d’y, 
a —1 Jg3 \IIx—yll@-! [xo — yll¢—! 


$(x) — (Xo) = 
IIx — yll 3 
Gy] In-—— ply)d’y, (a = 1). 
x3 IIxo — yll 
(2.126) 
Then show that for the spherical homogeneous shell in Eq. (1.4) 
ie = po 
GM —1)@G-a)’ 
fie ee Oe (2.127) 
2Rr Rp r2 In(r_/a) — ee. Cea 
where rs =r + Randr_ = |r — R| (see Exercise 1.2; see also Di Cintio and Ciotti 


2011). Finally, discuss the case for r = R and show by differentiation that Eq. (1.14) 
is recovered. 


3 
Tidal Fields 


In astronomy in general, and in the study of stellar systems in particular, one is often 
led to consider the effects of an “external” gravitational field on a body of some spatial 
extension: examples are satellites around planets, binary stars, open and globular clusters 
in galaxies, and galaxies in clusters of galaxies. The general problem can be mathematically 
very difficult; however, when the extension of the body of interest is small compared to the 
characteristic length scale of the external gravitational field (i.e., when the system is in the 
tidal regime), the problem becomes more tractable. In this chapter, we provide the basic 
ideas and tools that can be used in stellar dynamics when dealing with tidal fields. Among 
other things, we will find that tidal fields are not always expansive (as in the familiar case 
of the Earth-Moon system), as they can also be compressive (e.g., for stellar systems inside 
galaxies or for galaxies in galaxy clusters). 


3.1 The Tidal Potential and the Tidal Field 


The basic idea behind the concept of a tidal field experienced by an astronomical body of 
finite extension in an external! gravitational field (e.g., a globular cluster in the field of the 
host galaxy) involves performing a Taylor expansion of the external gravitational potential 
at some place near or inside the object of interest and then retaining just a few terms in 
the expansion. As we will see, the second order is the optimal truncation of the potential: 
a higher-order expansion would be more precise, but the second-order expansion of the 
potential leads to linear tidal forces, with all of the mathematical advantages of a linear 
force field. We therefore start by considering the external potential #(x) produced by some 
external density distribution p(x) and calculated from some of the techniques presented in 
Chapter 2. 


! “External” should not be necessarily interpreted in a geometric sense, it simply means that the “external” system is not the 


direct object of our investigation. For example, a galaxy hosting a globular cluster will be called external if we are interested in 
the dynamics of the globular cluster. 
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Let xo be the position of interest in some inertial reference system So and let x = xo + &. 
By Taylor expansion, up to second-order inclusive, we have 


$x +8) =H 00+ <V9@)|, E> +7 07E) so(igir), an 


where the dependence of the various quantities on xg is made explicit for clarity. The 
second-order tidal tensor T is manifestly symmetric, and in Cartesian coordinates 


ao 
7 = ; a2 
Tij 0) = Fax, lean (3.2) 
for i, 7 = 1,2,3. The quadratic form? 
T. 9 
oT (§) = = (3.3) 


appearing in Eq. (3.1), is the (second-order) tidal potential: @7 is naturally associated with 
quadric isopotential surfaces, and due to its symmetry 7 can be diagonalized with some 
appropriate rotation of the reference system. 

Taking the gradient of ¢ in Eq. (3.1) with respect to &, we obtain the linearization of the 
external gravitational field for points near x9, with 


g(x) = —Vo ~ (xo) — T(Ko0)§, 8 (Ko) = —VO(X) aa (3.4) 


where in particular —7& is the tidal field along the displacement &: notice that the tidal 
field at fixed xq depends on the direction and length of &. Also notice that from Eq. (3.2) 
the (invariant) trace of J is none other than the Laplacian of the external potential ¢, so 
that the Poisson equation (2.6) dictates that at the generic point x 


Tr 7 (x) = 4 Gpc(x). (3.5) 


It follows that Tr 7 = 0 where p = 0, so at these points the eigenvalues of the 7 produced 
by p cannot all have the same sign, and as a consequence the tidal field is expansive along 
some direction and compressive along some other direction. However, for points with p 4 0 
(i.e., inside a galaxy or a galaxy cluster), the three eigenvalues may all be positive, the 
isopotential quadric is an ellipsoid, and the tidal field is compressive along all directions. It 
follows that the common expectation of an extensive tidal field is actually limited to quite 
special cases (e.g., a satellite orbiting a planet or a binary star), and also in these cases only 
along particular directions. 


3.1.1 The Tidal Field Produced by Spherical Systems 


In order to gain some confidence with the geometric properties of 7, we now study the tidal 
fields produced by spherical and ellipsoidal systems, two of the most common idealizations 
encountered in stellar dynamics. We begin by considering the case of a spherical system 


2 Note that T in Ciotti and Giampieri (1997) is defined as the opposite of T in Eq. (3.1), so that the quantities 7; in Eq. (3.10) 
are the same therein and in this book. See also Ciotti and Dutta (1994). 
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with density profile o(r), cumulative mass profile M(r), and potential #(r). It is a simple 
exercise (do it!) to show that at distance r = ||x|| from the center 


4rGp(r) (r)-—1 
Tyo) = E + ax OE] (3.6) 
where 6;; is the Kronecker symbol. Moreover, 
pir) 3M(r) — 327(r) 
— , = = ; 337 
q= 5D. \M= Tos =e (3.7) 


where p(r) is the average density, within a sphere of radius r, of the body producing the 
field, and the second identity relates the average density at r and the angular velocity Q(r) 
of circular orbits at r (see Chapter 5). In the exercises, the explicit form of the function 
q(r) associated with some of the most widely used models of spherical stellar systems is 
derived; here, we simply note that outside a spherical body (and specifically for the case of 
the tidal field produced by a point mass), g(r) = 0, and that for monotonically decreasing 
density profiles (the situation in almost all astrophysical systems), g(r) < 1. 

Without loss of generality, we can obtain a simple geometric interpretation of 7 consid- 
ering an arbitrary point xp = (r,0,0), so that from Eqs. (3.6) and (3.7) 


2 Gago. 0 3q-2 0 0 
4G 
T= ee 0 1 0J/=2~[ 0 1 Of. (3.8) 
0 oO1 Oo oO ft 


Notice how in the present case Eq. (3.5) is of immediately verified. From Eq. (3.3), the tidal 
potential at r is given by 


(39) — 2p +E +8 

5 , 
so that the tidal isopotential surfaces are rotationally symmetric about the & axis (i.e., the 
radial direction) and reflection symmetric with respect to the €; = 0 plane (i.e., the plane 
tangent to the spherical surface of radius r). For 2/3 < q < 1 the surfaces of constant @7 
are prolate ellipsoids with the major axis along the radial direction, and the resulting tidal 
field is compressive all around x. The limit case g = | corresponds to spherical 67, while 
for g = 2/3 the equipotentials are cylinders with axes along the radial direction &), and 
the tidal field is (cylindrically) radially compressive toward these axes, with no tidal forces 
along &). Finally, for0 < gq < 2/3 the coefficient of & is negative; the isopotential surfaces 
are two-sheet and one-sheet similar hyperboloids of revolution around the radial direction 
for negative and positive #7 values, respectively. The two families are separated by the 
o7 = 0 cone, the opening angle of which gets smaller as g increases. The associated tidal 
field is expansive along &, and radially compressive in the perpendicular plane (&2,&3), a 
familiar situation occuring, for example, in the Earth-Moon system. 

We conclude this preliminary section by noticing an important and quite general prop- 
erty of g(r) as a function of the density profile o(r). In fact, the cases in Eqs. (3.58) and 
(3.59) show that the transition from the region where g(r) < 2/3 to g(r) > 2/3 (corre- 
sponding to a transition from a radially expansive to a radially compressive tidal field) will 


or (é) = 27(r) 


(3.9) 
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be experienced by a stellar system along its orbit in a host system when entering the shallow 
density “core” of the external density distribution p(r). For example, a globular cluster in 
radial orbit inside an elliptical galaxy or crossing the galactic disk of a spiral galaxy will 
be subjected to repeated compression and expansion phases, with significant structural and 
dynamical consequences (for a detailed discussion of the dynamical phenomena involved, 
see, e.g., Binney and Tremaine 2008; Spitzer 1987). 


3.1.2 The Tidal Field Produced by Triaxial Systems 


We now extend the previous discussion to the case of the tidal field produced by an ellip- 
soidal density distribution (e.g., a triaxial elliptical galaxy or a triaxial dark matter halo) 
at a generic point of space. We assume that the density is stratified on homeoidal surfaces 
p = p(m) as in Eq. (2.28), with the origin at the center of the distribution and axes parallel 
to the axes of the reference system. Moreover, we adopt, without loss of generality, the 
same convention as in Eq. (2.27) (1e., gz = a3/a1 < qy = a2/a, < 1). The associated 
potential is given in Eqs. (2.21) and (2.22) and the general expression of 7j; at a generic 
point x is given in Eq. (3.60): it follows that at the center of a sufficiently regular density 


distribution 


TE? + Th&> + T38Z 


5 : (3.10) 


TiO) = 2xGpO)wi =—-Ti, oT = 


the dimensionless factors w; are given by 


w; = ayana / ~ a (3.11) 
Pe” So G+ DVR | 

and finally 7;;(0) = 0 fori # j. Note that with 7; we indicate the opposite of the element 
Ti; when 7 is in diagonal form, so that from Eqs. (3.4) and (3.10), the tidal field at € near 
the center is (7&1, 72&2, T3&3). Note also that the specific form of the density distribution 
does not enter Eq. (3.11), and that with the adopted convention of a; > az > a3, we 
have wy < wo < w3 and0 > JT, > To => T, so the maximum (in absolute value) 
compressive force is along the short-axis direction of the density p(m). Finally, again from 
Eq. (3.60), we conclude that from a constant-density ellipsoid the tidal field obtained above 
actually coincides for all points inside the ellipsoid with the gravitational field produced by 
the ellipsoid, because from Chapter 2 we know that the potential inside a constant-density 
ellipsoid is a quadratic function of coordinates. 

In applications, it is quite useful to have the explicit expressions of the w; functions, and 
with some work it can be proved that 


F(g,k) — E(@,k) 
k2 sin? p , 
E(g,k) — Kk? F(y,k) — k*(qz/dy) sing 
k2k/2 sin? p , 
(qy/qz) sing — E(g,k) 
k? sin? p , 


wWyp= 24y4z 


w2 = 24y4z (3.12) 


W3= 24y4z 
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where y = arcsin ,/1 — q?, k? = (1 — q2)/(1 — q2), k? = 1 —’, and finally F(g,k) 
and E(¢,k) are the elliptic integrals of the first and second kind in Eqs. (A.62) and (A.63), 
respectively (see also Byrd and Friedman 1971; Gradshteyn et al. 2007). Finally, as is 
shown in Eqs. (3.62) and (3.63), for axisymmetric systems the functions w; reduce to a 
combination of elementary functions (see also Binney and Tremaine 2008). In Chapters 10 
and 13, we will again encounter the functions w. 

Admittedly, the formulae in Eq. (3.12) are not very illuminating; however, a qualitative 
understanding of the dependence of the functions w; on the flattenings can be achieved 
by considering the series expansion near the spherical limit. By setting gy = 1 — € and 
dz = 1 — 7 (see Section 2.2.1), with 0 < € < n < 1, itis a simple exercise (do it!) to show 
that the series expansion of Eq. (3.11) reads (see also Exercise 3.10) 


1 
wp ~ 2d -e( (5+), 


5 
1 
made n(5+ ==). (3.13) 


1 3 
‘i De n(5+S “). 


Of course, the same result can also be established (with substantially more work) from the 
series expansion of Eq. (3.12). Note that from the adopted choice of the axis labeling, the 
oblate case (with the short axis along z) corresponds to € = 0, while the prolate case (with 
the long axis along x) corresponds to € = 7. 


3.2 Rigid Bodies in Tidal Fields 


We now consider a few consequences of astrophysical interest obtained under the (more 
or less justified) assumption that the stellar system experiencing the external gravitational 
field behaves as a rigid body, with density distribution p,. and total mass M,, in motion in 
the tidal field. This approximation may appear unreasonable for a system made of stars, 
but in fact in certain circumstances the predictions compare quite well with the results of 
N-body simulations (for a more detailed discussion, see, e.g., Ciotti and Dutta 1994; Ciotti 
and Giampieri 1997; Muccione and Ciotti 2004; see also D’Ercole et al. 2000 for tidal 
effects on gas flows in elliptical galaxies). 

To set the stage, we consider an inertial reference system So and the time-independent 
density distribution p(x) producing the external field g = —V@(x), as well as the asso- 
ciated tidal field. We assume that the characteristic dimensions of p, are much smaller 
than the typical length scale over which the external potential changes, and we indicate 
with Rom = Soa xp,d°x/ M,, the instantaneous position of the center of mass of p,. From 
classical mechanics, the acceleration of Rcm is given by (see also Chapter 6) 


M.Acm = [ y 2(X) px(x; t)d°x ~ M,2(x0) — T (xo) [ 3 — x0) px (x; t)d?x 


= M,, [g(xo) — T (x0) (Rem — Xo)] = M.g(Rcm), (3.14) 
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where the possible time dependence of p,, in So (e.g., due to a change of orientation) has 
been made explicit. In Eq. (3.14), the first integral gives the full and exact contribution 
of the external field, the second and third expressions hold according to Eq. (3.4) for 
the linearized external field around the generic point xo, and the last expression is finally 
obtained for x9 = Rcm. From the first identity in Eq. (3.14), it is clear how Acy in general 
depends on the system extension and orientation, so that translations and rotations of 
extended bodies in an external field may be coupled in a very complicated (i.e., nonlinear) 
way. However, from the last identity in Eq. (3.14), it also follows that, at the second order 
in the tidal field, the motion of Rc» is not affected by the shape and orientation of p, when 
the external potential is expanded around Rcm, and so under this approximation we can 
decouple the motion of Rcm from the orientation and study the translation of the center of 
mass of Rcmw as the motion of a point mass M,, in the external field g. 

The second relevant identity of classical mechanics concerns the angular momentum of 
the system computed with respect to the reduction point xo (see again Chapter 6) 


d 
J= / (x — Xo) A V(X, t) 0x (X; t)d>x, a = M,Vcom A vo +N, (3.15) 
x3 
where Vom = dRcw/dt and vo = dxq/dt are the velocities of the center of mass of p, 
and of xo, v is the velocity of the distribution p,, at each position, and finally 


N= [.« — xo) A g(x) px (x; t)d3x ~ — [« — xo) A [T (Xo) (x — x0) ]ox(x; 1)d?x, 
| | (3.16) 


which is the torque? due to the external field in its exact and tidal expansion, respectively. 

Equipped with these important identities, we now consider two astrophysically inter- 
esting applications of the tidal field approximation: namely, the study of the motion of a 
triaxial galaxy at the center of a triaxial cluster and of a triaxial galaxy in circular orbit 
around/inside a spherically symmetric density distribution. 


3.2.1 A Triaxial Galaxy at the Center of a Triaxial Cluster 


In this section, we consider the simple case of a triaxial galaxy, modeled as a rigid body of 
density profile po, and total mass M,., with its center of mass Rcy at rest at the equilibrium 
point xo = O of an external potential (e.g., at the center of a triaxial galaxy cluster that 
is sufficiently regular near the origin; that the origin is a point of equilibrium is proved 
immediately by Eq. (2.72)). Therefore, Eq. (3.14) shows that Rcm will remain fixed in the 
origin of So, and only rotational motions of p, around xo = 0 will be possible. Finally, near 
the equilibrium position we have x = &, and Eqs. (3.1) and (3.4) reduce to 


T (0)x, 
b(x) ~ 6(0) + = a(x) ~ —7 (0)x, (3.17) 


where without loss of generality we can assume that 7 is diagonal in So. 


3 Actually, Eqs. (3.14)—(3.16) hold for a generic density distribution p,., not necessarily a rigid body, as can be proved from the 
general equations of fluid dynamics; see Eqs. (10.70) and (10.71). 
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A natural question of obvious astrophysical interest relates to the existence of equilib- 
rium configurations of ,. in the tidal field. Mathematically, this is equivalent to searching 
for the vanishing of the tidal torque in Eq. (3.16), which reads 


0g (x) 
Ni = ~eix f 1 pate a?x fo =eixTaO) [ X jX] P(X; t)d>x, (3.18) 
n 


m3" OX 


where €;;x is the Levi—Civita tensor and the integral in the last expression can be expressed 
in terms of the tensor Jj; appearing in Exercise 2.17. Using Eqs. (2.90) and (3.10), the 
antisymmetry of €;;<, and the symmetry of the inertia tensor S of p,, it is a nice exercise 
(do it!) to show that 


N, = 923A7T3, N2 = 93,AT31, N3 = S2ATi2, (3.19) 


where AT;; = T; — T; is a manifestly antisymmetric tensor; if the external potential and/or 
ps are spherically symmetric (and so A7T;; = 0 and/or 312 = 313 = 323 = 0), then no net 
torque acts on the stellar system and all of the orientations of p, are at equilibrium. What 
happens in the more realistic case of a nonspherical p,. in a nonspherical external potential, 
when AT;; 4 0 fori 4 j? From Eq. (3.19), it follows that p, is in an equilibrium as a rigid 
body if and only if 3;; = 0 fori # j (.e., if and only if the principal axes of p, are aligned 
with the axes of the cluster distribution (the axes of So)). In practice, a “rigid” triaxial 
galaxy (modeled as a rigid body) at the center of a triaxial cluster will be in a configuration 
of equilibrium when its axes are aligned with the cluster axes. 

We can now move to study the stability of the equilibrium positions and, for stable 
positions, to determine the oscillation frequencies of p,,. Let S’ be an orthogonal reference 
system with its axes directed along the axes of inertia of p, and with the origin coincident 
with the origin of So (and with Rey). Moreover, let R(t) be the orthogonal transformation 
matrix between S’ and So, so that x = Rx’, where x’ is a generic position vector in S’ (see 
Appendix A.1.1). In S’ the inertia tensor 9’ of ¢, is diagonal by construction, and we define 
I; = 3}, (no sum intended); for simplicity, from now on we consider only nondegenerate 
cases (ie., 1) #4 Ig ~ Ih and T; 4 Th # T3). Instead of using the usual representation 
of in terms of Euler’s angles (i.e., the successive combination of the 3-1-3 rotations,* 
where the number indicates the axis of the system around which the specific rotation is 
performed), it is convenient to adopt the counterclockwise 1-2-3 rotations (e.g., see Ciotti 
and Giampieri 1997): 


R= RiGR27)R3(W), (3.20) 
with 
1 0 0 
Ri(g)=|0 cose —sing], (3.21) 


0 sing cos@ 


4 This choice would complicate the treatment due to the indeterminacy of the angles yg and y for a null inclination (0 = 0). See 
also Eqs. (11.46)—(11.48) for a different choice of R as a 3-2-3 rotation. 
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cosv O- sin? 
Ro(0) = 0 1 0 : (3.22) 
—sin? O cost 


cosw —siny 0 
R3(vW)=|siny cosw OO}. (3.23) 
0 0 1 


In particular, from Eqs. (3.20) and (A.26), it is easy to prove (do it!) that the angular velocity 
of p, expressed in the reference system S’ is (see Footnote | in Appendix A.1) 


@ = oy + RA)loy + Rz(O)oy] 
= (gcosd cosy +d siny, — pcos? siny + cosy, gsind + W), (3.24) 


where @®y = (¢,0,0), @» = (0, 3,0), and wy = (0,0, w) are the three angular velocities 
associated with the three rotations and the suffix “T”’ means “transpose.” 

It should be obvious that, apart from a renaming of the inertia axes, the equilibrium posi- 
tions previously determined are given, without loss of generality (see also Exercise 3.7), by 
(9,0, W)eq = (0,0,0), and we now study the motion of p, around this configuration. We 
then construct the Lagrangian £ = T — U of the problem (e.g., Arnold 1978; Goldstein 
et al. 2000; Landau and Lifshitz 1969) by using the three angles as generalized coordinates. 
It is not diffcult to show (see Exercise 3.8) that the potential energy of p,. in the external 
potential can be written by using repeated indices as 


U= : P(X) Px (X; t)d°x ~ M,¢(0) + tGpe(O)Tr(S) + se (t)TjI,- (3.25) 
x3 


The first two terms on the right-hand side are constant, and so they do not affect the 
equations of motion of p,. In the same exercise we also obtain the explicit expression 
for the kinetic energy T of p,. The linearized potential and kinetic energies for small 
displacements from the equilibrium position (¢, 0, W)eq = (0,0, 0) are easily obtained from 
Exercise 3.9 (do it!) as 


Qh. Ih. Iz 
T~ 59 + so st 
(3.26) 
AT327AI327_ y= AT31;AI31 .5 = AT21 Ala a 
p 0 uw, 
2 2 2 
where AJ;; = J; — I; is the antisymmetric tensor built with the principal components 


of 3 and A7;; is defined in Eq. (3.19). Therefore, from the Lagrange—Euler equations 
d(dL/dqi)/dt = dL/dq;i, the linearized equations of motion for the galaxy near the 
equilibrium configurations are given by 

re AT32 Al32 “ AT3, Als, “ AT, Aly 

Q= Y, v= v, wv = 

fy lh b 

Following Eq. (3.10), let us assume, without loss of generality, thatO > 7; > 72 > T3 (as 
obtained for an ellipsoidal cluster distribution with a; > a2 > a3;1.e., that A732, AT3), and 
AT», are all positive quantities). From Eq. (3.27), stable motions (librations) are possible 


Vv. (3.27) 
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if and only if ) < h < J; (ie., according to Eq. (2.91), when the longest axis of px 
is aligned with the long axis of the triaxial cluster distribution, the intermediate with the 
intermediate, and the short with the short). In summary, a stable equilibrium configuration 
(in the nondegenerate case) is achieved when: 


(1) The principal inertia axes of p,. are aligned with the principal axes of the external tidal 
field. 
(2) If T; => Th => T3, then ) < hb < ih. 


Near the stable equilibrium positions, the solution of Eq. (3.27), corresponding to max- 
imum angular displacement and zero velocity at t = 0, is given by 


y(t) = pm cos(27 Vogt), V(t) = Pu cos(27 vet), W(t) = ym cos(27 vy Ft), (3.28) 


where the libration frequencies and associated periods are given respectively by 


1 ATx3 A132 J AT>3 (1 — v) 1 
Vo — — > Po =—, 
Qn i Qn Ju Vo 
1 AT,3 AL AT\3 1 — 1 
— BABI _ v 13 ( i) bes (3.29) 
Qn Ih Qn /v Vy 
1 AT)2AIn J AT\2 (v — u) 1 
Vy —i = > Py =>—, 
20 i 20 Vy 
and 
fF I 
“= = v= 2. (3.30) 
Is 3 


Note that only the ratios of the inertia moments of 0, appear in the expressions above, and 
that the stable equilibrium condition corresponds to the inequality u < v < 1; as expected, 
in the spherical limits, the libration frequencies vanish. 

Some insight into the dependence of the libration frequencies on the cluster and galaxy 
flattenings can be obtained by considering the limit of small flattenings. First, we model 
the cluster as a triaxial ellipsoidal density distribution with semiaxes a, > a2 > a3, so that 
the tidal field is given by Eqs. (3.10) and (3.11) and 


AT;j = —21 Gp (0)(wi — wj), (3.31) 


where (0) is the cluster’s central density. Second, for the galaxy density we assume that 
Px = px(m) in the system S’, with m given in Eq. (2.28), and semiaxes a, > a2 > a3, 
so that 


=l1-¢«, ~=1-n, €& < <1, (3.32) 
wal onl 


are the corresponding ellipticities. In S’, the galaxy inertia tensor is diagonal, with the 
explicit expression for Jj = Sj; given in Eq. (2.91) so that ) < Ih < 43; notice that 
the ratios u and v do not depend on the specific density profile p,(m), but only on the 
ellipticities, because the shape function cancels out. In Exercise 3.10, the student is 
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guided toward obtaining the leading term of Eq. (3.29) in the limit of small cluster and 
galaxy flattenings. The resulting formula will be used in Chapter 10 where, with the aid 
of the virial theorem, we will be able to estimate the libration frequencies, and we will 
discover that they can be of the same order of magnitude as the orbital periods of stars in the 
external regions of elliptical galaxies. In Section 3.3, we briefly discuss the possibility that 
associated resonances significantly affect the stellar orbits at the galaxy outskirts, leading 
to what can be called collisionless evaporation. 


3.2.2 A Triaxial Galaxy in Circular Orbit in a Spherical Cluster 


We now consider the more complicated case of a triaxial galaxy (again modeled as a rigid 
body of density distribution p,.) with the center of mass Rc on a circular orbit inside 
a spherically symmetric potential produced by a density distribution p(r). This situation 
could represent the idealized case of an elliptical galaxy orbiting inside a galaxy cluster. 
Note that the assumption that Rcy remains on the initial circular orbit independently of the 
orientation of p, is justified by Eq. (3.14); of course, the rotation of Rey does affect the 
equilibrium positions and oscillation times of the galaxy near stable equilibria. 

To set the stage, we assume without loss of generality that the orbit of Rcy is in the 
(x, y) plane of the inertial system Sg centered on p(r), as in Figure 3.1. Therefore, in So, 
the galaxy center of mass Rcm rotates with constant angular velocity 

2= 0,00, = 22). 3.33) 


r3 


r=Rcm 
where M(r) is the cumulative mass of the cluster density distribution inside the sphere of 
radius r. As is shown in Figure 3.1, the natural frame of reference for our problem is the 
noninertial reference system Scm, with its origin placed at Rcm, the e} = Reyw/||Rcm|| 
axis directed along the cluster radial direction, and e3 parallel to angular velocity (i.e., the 
z-axis of Sq); the direction of e» is finally fixed so that (e;,e2,e3) are positively oriented. 
We indicate with x and y the generic position vectors in Sg and Scm, respectively, and 
from Eq. (A.25) 


cosQt —sinQt 0 
x=Reowmt+Cy, C=] sinQt cosQr Of], (3.34) 
0 0 1 


where without loss of generality we assume that at the arbitrary time t = 0, Rcy(0) is 
aligned with the x-axis of Sp, and so Sg and Scyy are coincident with C(0) = J. It follows 
that in Scm 


oom =C'2=2, Tem =C'TC, (3.35) 


where cy is the angular velocity of Scm as observed from Scy, 7cm is the cluster tidal 
tensor at Rcm expressed in the system Scm and so given by Eq. (3.8), and finally 7 is the 
tidal tensor in the system So. 
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Figure 3.1 The definition of the three references systems needed to determine the equilibrium 
position of a triaxial body in circular orbit inside a spherically symmetric potential. Sp is the inertial 
reference system centered on the spherical distribution producing the tidal field, Scyy is centered 
on the center of mass of the triaxial body px and rotating on a circular orbit of radius Row, and 
finally S’ (dashed axes) is the noninertial reference system fixed in p, where the inertia tensor is in 
diagonal form. 


We can now start with a discussion of the possible equilibrium positions of p, in Scm 
as follows: we consider Eqs. (A.33)-(A.36) with x9 = Rcm, R = C, ® = @cm, and where 
we define J = Jo, N = No, the angular momentum and the torque in Sp with reduction 
point Rem. Finally, we set 3’ = 9, J! = J, and N’ = N, the angular momentum and torque 
in Scm, so that 


a [, YAYpxy.t)d’y, N=- [ y A (Tey) px(y.td°y. (3.36) 


We can now evaluate the Newton’s equation Jo = No in So, and after easy algebra (do it!), 
we obtain that in Scm 


J=N Se@cm 3@cM — @cm A J — @cm A S@cM, (3.37) 


where % is the (in general time-dependent) inertia tensor of p,. In Scm, the equilibrium 
positions of p, are then determined by the condition that J = J = 3 = O, along with 


60 Tidal Fields 


the condition (verified by hypothesis) that acy = Q and wcy = Q = 0. At equilibrium, 
Eq. (3.37) reduces to 


N-Q2QA32=0. (3.38) 


As for the case in the previous section, we now write Eq. (3.38) in components. We begin 
with the tidal torque N. It is clear that Eq. (3.19) also formally holds for the torque in Scm, 
where Eqs. (3.8)—(3.10) show that 


AT}3 = ATi. = 3(1— gq)” = 0, AT = 0; (3.39) 


note that the inequality above follows from Exercise 3.1. Moreover, Q A SQ = 
Q?(—393, 313, 0), so that the equilibrium condition in Eq. (3.38) finally reduces to 


Sy2 = 313 = 923 = 0. (3.40) 


In practice, equilibrium requires that the off-diagonal components of 3 vanish in Scyw (i.e., 
that one inertia axis of , points radially toward the cluster center and another is directed 
along the angular velocity direction). Therefore, without loss of generality, the equilibrium 
configurations can again be expressed algebraically as (¢, 0, W)eq = (0,0,0). 

We now determine the stability of the equilibrium positions, and for the stable positions 
we compute the oscillation frequencies by linearization of the Lagrangian function (in the 
system So) £ = T — U near the equilibrium configurations. As in the previous section, this 
requires the introduction of an additional reference system. Let S’ be the reference system 
centered on Rcm whose axes are the principal axes of ,. (see Figure 3.1) and let R be the 
transformation matrix between Scyy and S’, again defined as in Eq. (3.20) (ie., y = Rx’), 
so that the total rotation matrix between Sp and S’ is given by CR and x = Rom + CRx’. 
The potential energy in So is given again by Eqs. (3.25) and (3.26), where the expansion 
point is now Rcm, while the kinetic energy in So, expressed by using the quantities in S$’, 
is obtained from Eq. (A.38) that in the present case reads 

lots 
T=M, Bor ae a or = R'L + @, (3.41) 
where 3’ is the diagonal intertia tensor of p, in S’, wy is the angular velocity of S’ with 
respect to So (prove it!), and @ is the angular velocity of S’ with respect to Scm given again 
by Eg. (3.24), while the explicit expression of R'@, the angular velocity of Sc as seen 
from S’, is finally given in Eq. (3.72). 

The linearized potential and kinetic energies for small displacements from the equilib- 

rium position are easily obtained (do it!), and the resulting differential equations are: 


Q2AL Qh+h—-b)-: 
2 (+h oy 


if qi 
Q? + AT;3)AL Qh +h-t 
oc ( a 13) al 5 ( ieamee 9) 3) 6 (3.42) 
bh lh 
. AT»; Aly 
y = ——-v. 


i 
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The w equation immediately provides the first condition for a stable equilibrium, by 
virtue of A7T>; < 0 from Eq. (3.39), so that Jy > I. Then (see Exercise 3.12), the g and 
v0 equations can be separated into two identical linear fourth-order equations with constant 
coefficients, which are solved by setting 


g() = Xi", oe) =Ye™. (3.43) 
The resulting biquadratic characteristic equation for w is 
ow _ aw + B = 0, (3.44) 


where 


— 1) = _ = 
a= (S* 1) wed v8 3q)(1 2) «9 


uv u Vv 


(3.45) 


26-30 ==) 


uv 


B of, 
u and v are given in Eq. (3.30), so that solutions of Eq. (3.44) can be written as w = 
@(u,v,q) X Q, where @ is a dimensionless function, and the dependence on r is contained 
in Q and q. 

Stability requires that the four solutions to Eq. (3.44) be real (i.e., that the two solutions 
A+ of the associated quadratic equation are real and positive). For reality 


A =a’ — 4B > 0, (3.46) 
and for positivity Descartes’ rule of signs dictates that 
a>0; B>O. (3.47) 


Note that the conditions (3.46) and (3.47) are formally equivalent to the single condition 
a > 2,/B; of course, the numerical verification of stability for assigned values of u and v 
is trivial, while in the following we discuss the problem from the analytical point of view. 
The positivity of 6 in Eq. (3.45) is easily addressed, because 4—3g>1 from 
Exercise 3.1, so that we are left with the discussion of two possible stable configurations: 


Qu<v<1sh>h>h, 
(2) 1<u<v3Sh>]>h, 


where the stability of yw motion (2 > J); 1.e., v > u) has already been assumed. 

Case (1) is easy to discuss: in fact, foru < uv < 1, the coefficient a is a sum of 
positive quantities, and some algebra (do it!) shows that A can also be written as a sum 
of positive quantities. Thus, the motion near the equilibrium position is stable for I, > 
In > I, (see Figure 3.2) (i.e., for the minor axis of p, perpendicular to the orbital plane, 
for the intermediate axis directed tangentially to the orbit, and for the major axis directed 
toward the center of the external potential). 

Case (2) is far more interesting, but its analytical (algebraic) discussion is not easy and 
is quite long, and so it is not reported here. A detailed study of the conditions in Eqs. (3.46) 
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Figure 3.2 The parametric stability plane for the case of a triaxial galaxy in circular orbit about the 
center of a spherical cluster. The triangular region extending up to u = 1 and delimited by the vertical 
dashed line and by the line wu = v corresponds to the Case (1) stable configurations. The Case (2) 
stable region is bounded at the left by the vertical line vu = 1, at the right by the curve v = v+(u), and 
at the bottom by the line uv = v with 1 < u < uc; it is separated into two subregions by the vertical 
long-dashed line at vu = ua. In the figure, we adopted g = 1/2, so that ug X 1.15 and ue & 1.29. 


and (3.47) shows that for 1 < u < v (i.e., when the major axis of p,, is perpendicular to the 
orbital plane, the medium axis is directed toward the center, and the short axis is parallel to 
the orbital velocity), the condition a > 0 is verified once A > 0. Moreover, the stable 
region of Case (2) (see Figure 3.2) is made up of two parts separated by the vertical 
asymptote at 


——-. (3.48) 
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with uz — oo for g — 1. In the region 1 < u < uy there is no upper bound on J)//3. 
For gq < | there exists another stable region spanning the range ua < u < Uc, whose lower 
bound is again represented by v = u, while the upper bound on J)/J3 is given by 


14+90 —q)u-—6(01 - qyue + 6(1 — q)uJu(u — 1)(4 — 3q) 
120. — q)u? — 1201 —q)u-1 


The point of intersection between v,(u) and v = u is obtained by solving a fourth-degree 
equation, and it is given by 


wu =(u-) . (3.49) 


uc(q) = 77 5 3 —4)(5 +39) (3.50) 
A very important case is obtained for gq = 0 (i.e., in the case of a triaxial body orbiting 
outside a spherical body): in this case, the Case (2) region is very small, with u,(0) = 
1/2+ 1/V3 ~ 1.08 and u.(0) = 1/2 + 35/10 ~ 1.17. 

Having found the criteria for stable equilibrium, we now focus our attention on the 
associated libration frequencies. From the last differential of Eq. (3.42), the frequency for 
the y% motion is given again by the last differential of Eq. (3.29), where now 


1 {ATAL Q 1 
ry = <= 5—Vkw—w), k=30-9), Py=—, 51) 
Qn Lb Qn Vy 


1 | [ee ene AT 


and u and v are defined in Eq. (3.30). From Exercise 3.12, the g and ? librations result 
from the superposition of two independent oscillations, with frequencies 


alk 2 2: = 
ss afi 2h (EES See (t +k)2 — 20(t + 2kt —k) (3.52) 


Vo = ’ 
ge Or 2 


c= : t= : (3.53) 


Notice that the presence of the coefficient Q indicates that the periods of libration are of 
the same order as the orbital period of the galaxy around the cluster center. The student is 
invited to compute these times for some representative galaxy and cluster models, repeating 
the approach of Exercise 3.10, and using Exercises 10.20 and 10.21. 
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From the analysis of the two highly idealized cases in the previous sections, we concluded 
that the libration times of galaxies around their equilibrium configurations in the tidal field 
of the parent cluster can be of the same order of magnitude as the average stellar orbital 
times at the galaxy outskirts, and this makes it interesting to investigate in more detail the 
responses of such stellar orbits to periodic variations of the external (tidal) field. It is not 
unreasonable to expect that some fraction of the galaxy stellar population will “resonate” 
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with the galaxy libration periods and may be lost in the cluster intergalactic space: perhaps 
some fraction of the so-called intracluster stellar population (e.g., see Gerhard et al. 2005) 
is produced by such collisionless evaporation (e.g., see Muccione and Ciotti 2004 and 
references therein). 

The problem of stellar motions inside a galaxy with density distribution p, in a cluster 
can be described in the highly idealized framework adopted in this chapter. We call So the 
inertial reference system with its origin in the cluster center, and we consider the noninertial 
reference system S’ with its origin in the galaxy center of mass and axes aligned along the 
principal axes of the galaxy inertia tensor. Each star moves under the action of the physical 
force produced by the galaxy itself (g,.) and by the cluster (gext), so that from Eq. (A.29) 
we obtain that in S’ 


® = R1(g. + gext — Acm) — 2@7 AX’ — ot Ax’ — oT A (@T AX), (3.54) 


where x’ is the position of a star in S’, Ry = CR is the total rotation matrix between So and 
S’, and wy is the associated angular velocity of S’ as given in Eq. (3.41). From the previous 
discussions, it immediately follows that at the second-order tidal expansion, the physical 
force field in S’ is 


RA(Sx + Bext — Acm) ~ —Vurh, — T'x’, (3.55) 


where Vy’ is the gradient operator in S’, T’ = R17 Rr = R' Tem is the dependent tidal 
field in S’, and finally 7 is the tidal tensor expressed in Sp and Tom in Scm. Quite obviously, 
the two cases of galaxy libration considered in the previous sections are contained in 
Eq. (3.55). 

Consistently with the adopted “small oscillations approximation,” the linearized rotation 
matrix 7 and angular velocity wy are obtained from the expansion of their full expressions 
up to the linear terms in gy, Ym, and ym, so that (prove it!) 


1 -w ¢? 
R~lv 1 ol], or~ @-82,8+ 92,0 +Q), (3.56) 
v0 @ 1 


where the two cases of a triaxial galaxy at the center of a triaxial cluster and of a triaxial 
galaxy in circular orbit in a spherical cluster are obtained by inserting the explicit time 
dependence of the three angles in the two cases, and in the first case Q = 0. The result- 
ing differential equation (3.54), with periodic time-dependent coefficients, can be easily 
integrated numerically (e.g., see Muccione and Ciotti 2004), and the simulations nicely 
confirm that the outskirts of galaxies can be significantly affected by the cooperative effects 
of librations and of the cluster tidal field, with stars easily doubling their apocenters and 
being finally lost in the cluster, suggesting that collisionless evaporation may play some 
role in the production of the intracluster stellar population. 

As can be easily imagined, the literature on the effects of tidal fields on the structure 
and dynamics of stellar systems is enormous; for additional readings discussing prob- 
lems of great astrophysical relevance, the student is invited to consult more specialized 


Exercises 65 


books, such as Binney and Tremaine (2008), Chandrasekhar (1942), Ogorodnikov (1965), 
and Spitzer (1987). 


3.1 


3.2 


3.3 


3.4 


Exercises 


Prove that Eq. (3.6) gives the tidal field produced by a spherically symmetric density 
profile p(r) and show that g(r) = 1 — y/3 for the power-law density profile p(r) « 
r—Y in Eq. (13.29) with y < 3. Finally, prove that g(r) < 1 for monotonically 
decreasing p(r). Hint: The inequality p(r) < p(r) can be proved from integration 
by parts of the integral expression of (r) in Eq. (3.7) and then using the assumption 
dp/dr < 0. 
King (1972) proposed the density distribution 
0 r 

p= a sah, G.57) 
in order to describe clusters of galaxies (see also Eq. (13.34)), where r is the distance 
from the cluster center, r. is the “core” radius, and the distribution is truncated at 
some 7. From Eq. (3.6), show that forr < 7 
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3(s2 + 1)3/2 (arcsinhs —s/Vs2+ 1) 


q(r) (3.58) 


where arcsinhx = In(x + Vx? + 1), so that g = 2/3 for s ~ 1.027. 
Show that in the spherical limit (aj = a2 = a3 = rc), for y models in Eq. (2.83), and 
for the perfect ellipsoid in Eq. (2.93), we have 


i—y/3 253 s \" 
= ; = —— tans — —=—— F 3.59 
ee q(r) 302 + 12 (ae ans 2a :) ( ) 


where s = r/rc, so that g = 2/3 fors = (1—y)/2 (0 < y < 1) and s~0.824, 
respectively. Do the exercise for all of the spherical models from Eqs. (13.29)- 
(13.43) and determine for each of them the critical radius so that g(r) = 2/3. 

By using Eqs. (2.72) and (3.2), show that the tidal tensor 7 at position x, produced 
by a triaxial ellipsoid with p = p(m), is given by 


xix; p' (mr) dt 
(a5 + t)mz | (a? +1)J/AC) 


where m; = m(x,t) and p’ = do/dm. Show that at the center (m; = 0) of a suf- 
ficiently regular density distribution or in the case of a constant-density ellipsoid, 
Eqs. (3.10) and (3.11) are recovered. Finally, prove that the Poisson identity (3.5) 
holds. Hint: To prove the Poisson identity, write the trace of Eq. (3.60) and express 
the second term of the resulting integrand by using the second identity in Eqs. (2.73). 
An integration by parts of the second integral finally establishes the result. 


(oe) 
Tij = 20 Gayaza3 / i004 (3.60) 
0 
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By using Eq. (3.11), prove that 

w, +w2+ w3 = 2. (3.61) 
Hint: Write the sum above adding the three corresponding integrals and compare the 
resulting integrand with the quantity dA(t)~!//dt. 
From direct integration of Eq. (3.11), prove that in the oblate axisymmetric case 
a| = a2 > a; (ie., for gy = 1), 


wp = w= 


V1—e2 (= ; *) 
e ’ 


e2 e 
(3.62) 
/1 — e2 1 arcsin e 
w3 = , 
° e? VJ 1 — e2 é 
while in the prolate axysimmetric case a, > az = 43 (1.e., for gy = gz), 
1 —e? /arctanhe 
w, = 2—, 1), 
er e 
(3.63) 
1—e? 1 arctanh e 
WwW = w32= ; 
: ° e2 1-—e2 e 


where in both cases we adopt the definition e = ,/1 — g? = sing as in Eq. (3.12), 


and we recall that for |x| < 1, arctanhx = In./(1 +x)/(1 — x). Show by direct 
evaluation that identity (3.61) is verified. 
Show that the inertia tensor of p, in Sp can be written in terms of R and J; as 


3 
Sn@= >. RieORadola (3.64) 
p=1 

and discuss the vanishing of Sj, for 7 # k as a function of the rotation angles. Prove 
by direct evaluation that the trace of 3 is invariant, Tr(S) = J; + bb + 4h. Hints: 
Change the coordinates in Eq. (2.89) to y = Rx’, where 7 is the orthogonal matrix 
in Eq. (3.20), and recall that in S’ the inertia tensor of p, is diagonal, with J; = a F 
(no sum over i intended). Also recall that the rows and columns of an orthogonal 
matrix can be seen as vectors of unitary norm. 
Prove Eq. (3.25). Hints: Substitute the first identity in Eq. (3.17) into the left-hand 
side of Eq. (3.25): the resulting integral splits into two integrals. The first is trivial. 
For the second integral, use the second identity in Eq. (2.90): the first integral is 
obtained from Eq. (3.5) and the second one is from the diagonality of 7 in So and 
Eq. (3.64). 
Show that the kinetic and potential energies of p, for the problem discussed in 
Section 3.2.1 can be written as 


2T = 1 (d sinw + ¢cos ¥ cos yy + Ih(d cos yw — pcos 8 sin)” 
+ bw + gsinvd)’, (3.65) 
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and 


2U = (cosg¢cos yw — sing sin? sin WY ATy Alyy + (sing cos BD)? AT Als} 
+ (singcos w + cos@ sin? sin W)°AT31 Alay + (cos g cos 0) AT3 Ala, 
(3.66) 


where all of the additive constants in Eq. (3.25) have been set to zero. Hints: The 
expression for the kinetic energy is obtained from Eq. (3.24). For the potential energy, 
focus on the last term in Eq. (3.25), and from the definitions A7;; = 7; — T; and 
Al;; = [; — 1; show that 


3 
ROTI, = D> REOAT ATA +1 +b + b) 
i, p=2 


+ (1 + Tr + T3) — 371, (3.67) 
simply by using the orthogonality of R. 
With this exercise we consider the limit for small cluster and galaxy flattenings of the 


galaxy libration frequencies in Eq. (3.29). First, show that for a triaxial cluster with 
a, > az = az (so that € < n < 1), from Egs. (3.13) and (3.31) 


82 Gp (0) 82 Ge(0) a 82 Gp (0) 


Ag (=<), - Ay — Ai) 
5 > 5 
and from Exercise 2.18, that 
~  2Mazh ” Ma;h .  2Mazh 
Su = d-e-7), 32 = (-—7), 333 = (1 —e), 
3 3 3 
(3.69) 
ath ath 2azh 
Q1= 3 (e+), Q2= 37 —2€), Q1= 3 (€ — 2n). (3.70) 
Finally, from Eqs. (3.30) and (3.32), show that at linear order in the galaxy flattenings 
l—v l-u 
~ Nx — Ex, ~ x, U-U~ Ex, (3.71) 
u v 


and finally obtain the leading term of the resulting Eq. (3.29). See also Exercise 10.20. 
Show that in Eq. (3.41) 


R'g= Q(sin g sin wy — cos ¢g sind cos w, sing cos f 
+cos@ sin? sin y, cos g cos 7). (3.72) 


With this exercise we construct the solution for assigned initial conditions (¢, Go) 
and (v0, 3) of the two first components of Eq. (3.42), which we rewrite as 
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First, show that equations above can be separated into two identical fourth-order 
ordinary differential equations, 


d*o d*p d*9 do 


with the two constants 
a= A,+B\B.+ A2, B= AjA?2, (3.75) 


given in Eq. (3.45). Then show that the four solutions of Eq. (3.44) are 


pa 2_ 4 
1.2.3.4 = 2W4, WL= — (3.76) 
so that the general solution for g is given by 
et) = Nie! 4 MeO" 4 eo Le, (3.77) 


where X; are complex numbers to be determined from the initial conditions by 
evaluating up to the third derivative of p(t) at t = 0: 

X, + X2 + X3 + X4 = Go, 

(X| — X2)@4 + (X3 — X4)@_ = —iGo, 

(X1 + X2)@% + (X3 + X4)w2 = Aigo — Bi Vo, 

(X1 — X2)@} + (X3 — X4)o? = —i (Aigo — Bido), 


(3.78) 


and a similar expression holds for 7 (¢). Finally, from the first and third identities in 
Eq. (3.78), show that X; + X2 and X3 + X4 are real, and from the second and fourth 
identities in Eq. (3.78), show identities in Eq. (3.78), show that X; — X2 and X3— X4 
are purely imaginary, so that X2 = X} and X4 = X3 and g(t) and #(¢) in the stable 
case can be written in terms of real trigonometric functions. 

With this exercise we explore (qualitatively) the important astrophysical concept of 
tidal disruption, and we estimate the size of the so-called Roche limit. Consider a 
self-gravitating spherical body (a star or a satellite) of mass m and radius R moving 
in the gravitational field of a larger mass M (a black hole or a planet). First, let the 
center of mass of the body m approaching M be on a radial orbit at distance r along 
the x-axis; use Eqs. (3.8) and (3.55) to determine the critical distance rroche where 
the tidal field along the radial direction balances the gravitational pull Gm/R? for a 
point at the equator of the falling system and show that 


2M 1/3 
TRoche = — R. (3.79) 


m 
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For r < rRoche, the object is presumably disrupted by the tidal forces due to M. 
Repeat the exercise for the object on a circular orbit of radius r around M, and again 
from Eqs. (3.8) and (3.54) show that now 


3M 1/3 
Roche = | —— R. (3.80) 


m 


Explain physically why the Roche limit in the case of a circular orbit is placed at 
a larger distance from M. Finally, recast the two equations obtained in terms of 
the averaged densities of the two bodies, using Eq. (3.7) for the body M. Hint: 
Use Eq. (3.54) with the center of mass of m on the x’-axis of the system S$’, with 
x’ = (R,0,0), and in the rotating case with wy = (0,0, Q). 


4 
The Two-Body Problem 


In this chapter, we introduce and solve (by means of the Laplace—Runge—Lenz vector) the 
two-body problem, with an emphasis on the properties of hyperbolic orbits, and specifically 
on the so-called slingshot effect. The obtained results will be used in Chapters 7 and 8 for 
the derivation of two fundamental timescales characterizing the dynamical evolution of 
stellar systems: the two-body relaxation time and dynamical friction time. 


4.1 Center of Mass and the Reduced Mass 


The two-body problem is, togheter with the harmonic oscillator, one of the most celebrated 
cases of a “solvable” problem of classical physics. Its importance in stellar dynamics is 
fundamental, as will be clear from (some of) its consequences worked out in later chapters. 
Here, we focus our attention on the main properties of the orbits: it is assumed that the 
student has already been exposed to the basic results in the field (the literature on the subject 
is immense; suggested readings of different levels of difficulties are, e.g., Binney and 
Tremaine 2008; Boccaletti and Pucacco 1996; Danby 1962; Goldstein et al. 2000; Landau 
and Lifshitz 1969; Roy 2005; Szebehely 1967). Before addressing the problem, it may be 
useful to remind the reader that in stellar dynamics — at variance with celestial mechanics — 
attention is focused more on unbound (hyperbolic) orbits than on elliptical orbits. 

As is well known, different formulations of the basic laws of mechanics allow for 
the use of different analytical methods for the solution of Newton’s differential equa- 
tions of motion. Thus, in addition to the vectorial form (the modern version of Newton’s 
severely geometric approach; e.g., see Arnold 1990; Chandrasekhar 1995), the student 
certainly knows the elegant and essentially analytical (coordinate-based) Euler-Lagrange 
formulation, and perhaps the deeper Hamiltonian approach, in which phase-space geometry 
and action-angle variables are the major ingredients (e.g., see Arnold 1978; Binney and 
Tremaine 2008). Due to the introductory nature of this book, in our discussion we will 
use the usual vector approach, but we will also exploit the full power of the conserved 
quantitites of the problem, avoiding the need to perform explicit integrations. 
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We start from the equations of motion for two point masses or stars (a test mass mt 
and a field mass mz) in a generic inertial reference system So mutually interacting with the 
gravitational force 


Gmyme Gnmyme 


Yr, mgx¢ = ah re lIr|l, (4.1) 
r 


where r = xX; — X¢ is the relative orbit of m;, with respect to mr and v = r = vy; — ve is the 
relative velocity. Of course, the equations must be supplemented by the initial conditions 
X10. Vio. Xf0, and veo. 

As is well known, by adding the two equations in (4.1), we can show that the conser- 
vation of the total momentum P = mv; + mev¢ is embodied in Newton’s law, and from 
double time integration 


mX, + mgX¢ = M (Romo + Vemt), M =m, + me, 
4.2 
mM Xto + MFXfO Vv ™MVi0 + MEVEO a) 


Remo = =i SO CM = a Ta 


where Rcmo is the position of the center of mass of the pair at t = 0 and Vcwy is the constant 
velocity of the center of mass Row = Rcmo + Vcmt. Moreover, by scalar multiplication of 
the first equation in (4.1) by x; and of the second equation in (4.1) by x¢ and by summing the 
two resulting identities, we obtain the expression of the total time derivative of a function 
that remains constant when evaluated along the orbits of the two masses: 


mallvel>  mellvell? _ Gmump (4.3) 
2 2 r 


In other words, we prove that the total energy E is conserved.! Finally, from summation of 
the cross product of the first equation in (4.1) by x; and of the second equation in (4.1) by 
Xf, it follows that the vectorial quantity 


m Xt A Vt + mexe A VE = J (4.4) 


is also conserved when evaluated along the orbits of the two masses: J is the total angular 
momentum referring to the origin of So. The constant quantities Remo, Vcm, £, and J are 
fixed by the initial conditions, and so their specific values depend on the specific reference 
system So adopted, but their existence is guaranteed in all of the inertial reference systems; 
in practice, in the two-body problem we have 10 quantities conserved during the orbital 
evolution (also known as the /0 classical integrals of motion; see also Chapter 6). It is a 
simple exercise (do it!) to show that all of the results obtained so far are not restricted to 
the 1/r? force law, but also hold in the case of a generic force law that can be derived 
from an interaction energy U(r), and thus automatically obeying Newton’s Third Law of 


! This approach is so relevant to the theory of ordinary differential equations that the analogous procedure used to reduce the 
order of differential equations is usually referred to in mathematics as the “energy method.” 
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Dynamics, with mx; = —Vx,U and mfx¢ = —Vx,U = Vx,U. Remarkably, as we will see 
in the next section, an additional quantity is conserved in the specific case of the central 
1/r? force field. 

The reference system So is nothing special (except for it being inertial), so that formally 
identical expressions will hold in any other inertial reference system S’. In particular, the 
coordinates of a point in the two reference systems (see Appendix A.1.1) are related by 
the Galilean transformation x = R + Rx’, with R(t) being the origin of S’ moving with 
constant velocity V = dR/dt with respect to the origin of So, and R being an orthogonal, 
time-independent 3 x 3 matrix fixing the constant orientation of S’ with respect to So. We 
highlight the important (but not always fully appreciated) fact that by changing inertial 
reference system, the values of the constant quantities Rcmo, Vcm, £, and J in general 
will change (i.e., these quantities are constants in different? inertial systems, but they are 
not invariants). Among all of the inertial reference systems, the most important are those 
moving with velocity V = Vcm with respect to So, and of these, the most important the 
subset (which we indicate with Scm) with the origin coincident with the center of mass 
(i.e., R = Rc). As is shown in Exercise 4.1, in this last family (when restricting without 
loss of generality to R = J; 1.e., when the axes of Sg and Scy are parallel), the following 
identities hold at all times: 


mX, + mex, = 0, mv, + mev; = 0, 


IIvi ll? Iv? Gmyme 
E=M———+4+E', EF’ = : , 
5) + mt > + me 5) ‘ 


J=MRcm0 A Vem + J’, J! = mex, A vy + eX; A Vp, 


(4.5) 


where E’ and J’ are the total energy and the total angular momentum of the pair in the 
barycentric reference systems Scm. 

We are now in a position to solve Eqs. (4.1), a process that will be completed in 
Section 4.3 after an interlude in Section 4.2 dedicated to the slingshot effect. The standard 
procedure to solve Eqs. (4.1) is to combine them in the differential equation for the relative 
orbit of m, with respect to mg: after division of the first equation by m;, and of the second 
by me, by evaluation of xX; — Xf one gets 


38 Gmmn mm 
r= sn i= : - M=m-+mey, (4.6) 
r M 
where jz is the reduced mass. 4 = m/2 for identical masses, m_ = mp = m, while 


in the case of a significant mass discrepancy, jz is just slightly less than the mass of 
the light component. Mathematically, Eq. (4.6) is identical to the equation of motion of 
a point of mass jz orbiting in a radial 1/r? field with a fixed center and a magnitude 


2 The student should be convinced that different values of conserved quantities do not simply represent a mathematical property 
without physical importance. Consider a free particle moving with constant velocity in some inertial system: its (kinetic) 
energy is conserved, and it can be used to do some work (e.g., by heating a target in an inelastic impact). When observed in a 
second inertial system moving with the particle velocity, its kinetic energy is again conserved, but its value is 0, and no work 
can be done in this system. 
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equal to that between the two particles. The student should realize that the physics behind 
this transformation is much less trivial than the resulting mathematical expression. First, 
Eq. (4.6) is invariant for all of the observers, as well as in accelerated motion, provided 
their reference system is not rotating (i.e., all of these observers would determine the same 
relative orbit (prove it!)). Second, despite Eq. (4.6) being formally identical to that of a 
gravitational problem in an inertial reference system, it is actually obtained by referring 
the coordinates to x¢(t) (i.e., to a noninertial reference system), and the student should ask 
them self (and answer) why the noninertial translational term in Eq. (A.29) seems to be 
missing on the right-hand side of Eq. (4.6). 

The full solution of Eq. (4.6) can be obtained by following standard procedures (e.g., see 
Goldstein et al. 2000; Landau and Lifshitz 1969), and we will work on this in Section 4.3. 
For the moment we assume that the relative orbit r(t) has been determined, and we focus 
on how to recover the orbits x; and xs in Sg from the knowledge of the relative orbit r. We 
exploit the unique properties of Scm: from the first equation in (4.5) and from the identities 
r =X — xf = xX, — x; and v =f, it follows that in Scm 


<= , nee Y= al ee al (4.7) 
Mt Me Mt Mf 
and so in So 
pr(t) LLV(t) 
X(t) = ee + Rem(t), v(t) = aa + Vom, 
t t 
4.8 
rit) yv(t) a 
xe(t) = —-—— + Rem (1), ve(t) = — + Vem 
me m 


It is apparent how the two barycentric orbits are simply scaled-down (homotetic) versions 
of the relative orbit r and how the reconstruction of the two “absolute” orbits in Sg from r 
has been achieved thanks to the fact that Row = 0 = P’ in Sqm. Finally, the student 
is invited to verify that the energy E; and the angular momentum J, of the relative orbit 
are conserved and that their values coincide, respectively, with those of E’ and J’ in Scm, 
introduced in Eq. (4.5). In formulae, 


i 
v G 
: un aa =E’, Jy=prav=J. (4.9) 


4.2 The Slingshot Effect 


Before moving on to solve the gravitational two-body problem, we consider a very general 
and important phenomenon associated with central forces that can be derived from an inter- 
action energy U(r) allowing for unbound motions (i.e., two-body interactions beginning 
and ending at an infinite relative separation of m, and me). The 1/r? force is of course 
the most important case, while examples of forces that do not share this property are the 
harmonic force (with U « r?) and the modified Newtonian dymanic (MOND) force (with 
a qualitative U « Inr behavior in the deep regime; see Exercise 2.34). The phenomenon 
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under consideration is known as the slighshot effect, and it is used (for example) to send 
probes to outer planets in the Solar System (or to slow down probes so that they can fall? 
toward the inner planets). But how does the slingshot effect work? After all, Eq. (4.9) tells 
us that energy is strictly conserved in the relative orbit and, perhaps making the effect even 
more mysterious, we will show that in Scm the moduli of the velocities of the two objects 
(e.g., the planet and the probe) at the beginning and at the end of their interaction are also 
unchanged: How can the probe then change its energy? 

The explanation is rooted in the fact that energy is conserved, but it is not invariant for 
a change of (inertial) reference system, and the energy balance between the probe and the 
planet in the reference system So (e.g., the system centered on the Sun) is different from 
that in the system Scm of the planet and the probe. Suppose that v;(—0o) is the initial 
velocity at f = —oo of the test mass m, and v¢(—oo) is the initial velocity of a field 
mass mg in a generic inertial reference system So. Moreover, let us assume that the initial 
separation of the two masses is infinite, so that from Eq. (4.5) the total relative energy of the 
pair coincides with the initial total kinetic energy. From Eq. (4.9), in the relative orbit the 
conservation of E, also reduces to the “conservation” for t = oo of the relative kinetic 
energy, i.e., 


I|[v(+00)|| = Ilv(—o0)||, v= vt — VP. (4.10) 


Our interaction can be imagined as an elastic collision, even if no real collision is occur- 
ring. It should be superfluous to stress that the “conservation” of the modulus of v at the 
beginning and at the end of the interaction does not imply that the direction of the relative 
velocity also remains unchanged (see Figure 4.1), and so in general 


Av = v(+0) — v(—oo) = Av, + Avy £0, (4.11) 


where Av, and Av, are the components of the change of relative velocity perpendicular 
and parallel to the initial relative velocity v(—oo), respectively. By definition, 


vi(—co) =0, vy (—00) = V(—00), 
(4.12) 
Av, =vi(+00), Avy = vj (+00) — v(—ov), 


so that from v(+00) = v(—oo) + Av and Eq. (4.10) we obtain the exact identity along the 
relative orbit 


Avil? + Avy ll? + 2(Avj,v(—00)) = 0, (4.13) 


with the necessary conclusion being that the inner product in (4.13) is negative (i.e., that 
Avj is necessarily antiparallel* to v(—oo)). It is useful to consider the positive and negative 


It may be of some surprise to the reader to realize that sending a probe to Mercury actually requires a lot of energy. This is 
because the orbital velocity of Earth around the Sun is much larger than Earth’s escape velocity, and the orbital velocity of 
Earth must be subtracted from that of the probe to allow its “fall” toward the Sun! 

By definition, Av) is parallel to v(—oo), so that in the limit case of Av, = 0, from Eq. (4.13) necessarily Avy = 0, or 
Av\ = —2v(—ov) (prove it!). 
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Figure 4.1 Schematic representation of an unbound two-body relative orbit in an attractive force field 
of potential energy U(r) allowing for escape. In the case of the 1/ r? force with U = —Gmyme/r (or 
in Rutherford’s electrostatic scattering experiment), the relative orbit is a hyperbola. For the meaning 
of the vectorial quantities, see Eqs. (4.10)—(4.12). 


terms in Eq. (4.13), respectively, as “heating” and “cooling” of the relative orbit due to the 
gravitational interaction, and in Chapters 7 and 8 we will elaborate on this point. 

Figure 4.1 illustrates qualitatively the geometry of the relative orbit in an unbound elastic 
interaction. In particular, the rotation angle Ag is usually obtained by direct integration of 
the equations of motion of the reduced mass in polar coordinates (see Exercise 4.7 and the 
next section for the 1/r? case) and the relative orbit rotation angle is given by 2Ag — a 
(prove it geometrically!); therefore, Ag = 2/2 corresponds to the limiting case of a straight 
(unperturbed) relative orbit, Ag = 37/4 to a rotation of the relative orbit of 2/2, and 
Ag = x to a complete inversion (rebounce) of the relative velocity. The student should 
prove the following identities involving the rotation angle,> 


lv (+00)|| = —|lv(—co)|| sin(2A¢), nies 
(vj (+00), v(—00)) = —||v(—00)||? cos(2AQ), 
and finally 
Avi ll = llvi(+oo)|], || Avy ll = 2Ilv(—00) || cos*(Ag). (4.15) 


5 Notice that sin(2Aq@) < 0 and cos(2Aq) > 0; see also Eq. (4.25). 
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We now move from the relative orbit to the orbit of the test mass in So: Eq. (4.8) shows 
that the exact identities 


Ay = AV, Avy = Avi, Avy as Ay (4.16) 
mt mt mt 


hold, and analogous relations also hold for the field mass mr (prove them!). We remark 
again that the two directions involved are not the directions perpendicular and parallel to 
the initial velocity v;(—oo), but to the relative initial velocity v(—oo). The change of the 
kinetic energy of the test mass in Sp associated with the slingshot effect is given by 


AE, = = x [IIve+00) I? = Ilvi(—00) 1. (4.17) 


Two extremely useful (and of course equivalent) expressions for A E; can be found, the first 
by inserting into Eq. (4.17) the initial and final values of vy; expressed in terms of v from 
Eq. (4.8) by using the fact that Vcm is constant during the interaction, and finally taking 
into account Eq. (4.10): 


AE, = (Av, Vom) = m(Avt, Vom), (4.18) 


where the last identity derives immediately from Eq. (4.16). In Eq. (4.27), we will construct 
a second alternative expression for AF; without explicit reference to Vcm, which we will 
use when considering the collective effects of stellar encounters in astronomical systems. 
A few important comments are in order. The first is that, obviously, the total energy 
of the pair is conserved, with AEs = —AE,. The second is that, as has already been 
stressed, the formulae in this section hold for generic forces allowing unbound orbits, and 
for illustrative purposes in Exercises 4.8 and 4.9 we will compute the deflection angle in 
the limit of very large angular momentum for the case of 1/r® forces. The third is that, 
according to Eq. (4.18), AE; depends on the rotation of the relative velocity vector with 
respect to Vcm, so that both energy gains and losses are possible in So, depending on 
the angle made by Av and Vcm. This property is routinely used for the orbital design of 
interplanetary missions. This last point is further illustrated by considering a (highly ideal- 
ized) application of the slingshot effect to planetary rings/protoplanetary disks. Following 
Figure 4.2, let us consider a body of mass M traveling from right to left in a system So with 
velocity vm: it could be a shepherd moon moving counterclockwise in a circular orbit in 
the plane of the rings of Saturn (placed at the origin of So) or a planet embryo rotating in a 
protoplanetary disk with the protostar at the origin of So. Clearly, we are only looking at 
a little piece of the orbit, so that its curvature is negligible. The object M interacts with a 
pebble of mass m < M (the focus of our attention), also rotating counterclockwise but on 
a slightly larger circular orbit, with velocity Vm; in case of a Keplerian rotation curve, M 
will therefore reach and surpass m, and near the ecounter the two velocities will be almost 
parallel, with a relative configuration depicted in Figure 4.1, where mp = M and m, = m. 
After the encounter, the body m is pushed down, and from Eq. (4.18) its energy is increased 
because, due to the enormous mass difference, Vom ~ vm(—0v), and so the angle between 
Vc and Av is less than 2/2 (see Figure 4.2). Notice that, according to Eqs. (4.30) and 
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Figure 4.2 Schematic representation of a shepherd moon (M) interacting with a small rock (m) ina 
planetary ring. The planet center, in the inertial reference system So, is at the bottom of the page, and 
both M and m are rotating counterclockwise. In the right part of the figure, the diagram shows the 
orientation of the initial and final relative velocities and the total change of the relative velocity; see 
the text for details. 


(4.31), this positive energy variation is associated with a final orbit of m around the center 
of So with a longer semimajor axis and a longer period; in practice, the gravitational “kick” 
increases the kinetic energy of m, but as a net result the average kinetic energy decreases 
(we will come back to this fundamental property of gravity in Chapter 6). The student is 
encouraged to repeat this discussion for the case of the light body m initially rotating on 
a circular orbit with a radius smaller than that of the orbit of M, so that in this case M 
is surpassed by m during the interaction. The result is that now the kinetic energy of m is 
decreased during the interaction and m is momentarily turned away from the central planet, 
but the final orbit is characterized by a shorter semimajor axis and higher average kinetic 
energy than before the interaction. 

If we now imagine millions of such encounters of M with little orbiting bodies, we real- 
ize that M will “clear” an annular region around its own orbit, pushing the pebbles further 
out on larger orbits, and sending the pebbles on smaller orbits toward the planet/protostar.® 
So what is the fate of M/? In fact, the energy gain/loss of the population of little masses m is 
associated with a corresponding energy loss/gain of M: a single encounter will do nothing 
to M, but when we sum over an enormous number of encounters, it is quite obvious that M 
must radially migrate inward or outward as a function of its total energy balance (negative 


6 In ring dynamics, much more complicated dynamical phenomena (e.g., resonances) are at play, producing the opening of 
“gaps” in the rings (e.g., see Esposito 2014; Goldreich and Tremaine 1982). 
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or positive), and this in turn will depend on the specific form of the disk rotational velocity 
curve and on the disk radial density profile. It is believed that radial migration was a very 
important phenomenon during the epoch of the formation of our Solar System. 


4.3 The Laplace-Runge-Lenz Vector 


As is well known, in the case of 1/r? forces, the two-body problem can be explicitly solved 
(or, using the old terminology, “reduced to quadratures”’; 1.e., it is possible to work out the 
solution in terms of successive integrations) in terms of simple functions. Here, instead 
of solving the problem with the usual approach based on explicit integration, we will 
follow a different path based on the full use of the conservation laws (whose mathematical 
expressions are sometimes known as first integrals; i.e., of functions of coordinates and 
velocities that remain constant when evaluated along orbits). Actually, the student should 
appreciate that the direct integration methods are essentially based on the existence of a 
sufficient number of global conservation laws: due to its relevance to stellar dynamics, 
this argument will be discussed again in Chapter 9. Here, we simply recall that when a 
dynamical system (a differential equation) admits a sufficient number of globally con- 
served quantitites, the system is said to be integrable in the sense of Lie—Jacobi. One of 
the most important and somewhat unexpected breakthroughs in mathematics and physics 
(with important consequences for classical and quantum mechanics) was the recognition, 
due to Poincaré (1854-1912) while working on the three-body problem, that dynamical 
systems may not have a sufficient number of such conservation laws (Poincaré 1892; 
see Barrow-Green 1997 for a very interesting and readable account). In practice, it was 
proved that some of the most significant problems that resisted solution did so not because 
the great mathematicians working on them were not sufficiently smart, but because these 
problems belong to the class of nonintegrable systems. Of course, nonintegrable systems 
can also have perfectly well-behaved solutions (in the sense of Cauchy; i.e., regular, unique, 
and continuously dependent on the initial conditions), but these solutions cannot in gen- 
eral be described by the intersection of global analytic varieties in phase space (obtained 
by fixing the values of the global conservation laws from initial conditions). The inter- 
ested reader can find very clear discussions of some of these fascinating problems which 
are of great importance for celestial mechanics and stellar dynamics, in several excellent 
books (e.g., see Arnold 1978; Bertin 2014; Binney and Tremaine 2008; Buchler et al. 
1988; Contopoulos et al. 1994; Evans 1990; Gutzwiller 1990; Lichtenberg and Lieber- 
man 1992; McCauley 1997; Whittaker 1917; Wintner 1947), where other issues (e.g., 
how additional conservation laws, in addition to the “natural” ones from Noether’s the- 
orem and evident symmetries of the potential, can be discovered with specific procedures) 
are also discussed. 

Here, due to the introductory nature of this book, we will only show that in the two-body 
problem with 1/r? forces we have enough conservation laws (in addition to the 10 classical 
ones) that the solution can be determined without resorting to explicit integration (see also 
Chapter 9 for further discussion). In fact, from Exercise 4.3 
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d : A 
r 2 Vv (r,v)r am Jr . (4.19) 
dtr r r pr 


The next step is to multiply the previous identity by —Gm,m¢ and then follow the surprising 
chain of identities: 


d/G G é a 
mmr Jr A mymnsr = Jr A uy = (v A Jr), (4.20) 
a i 3 dt 


LL dt 
where we used Eq. (4.6) for the relative orbit and the time independence of J;. We conclude 
that on the relative orbit, the vectorial quantity 


r r 


G 
Gmumd =v AJ, —- tr, = III (4.21) 
| i 


is conserved, where the dimensionless constant vector I is the so-called’ Laplace-Runge- 
Lenz vector. First, we note that l is placed in the orbital plane (i.e., in the plane perpendicular 
to J,). Second, we multiply the identity above by r and we call @ the angle between 1 
and r (the true anomaly), so that (lr) = r/ cosg, and from Exercise 4.4 we obtain the 
fundamental result 

Dp 


= —— —, (4.22) 
1+l/cos@ 


where 


_ il? fy, elle? 


= ae esol 4.23) 
Gmymepe G?m?m2 ue 


Equation (4.22) is the polar form of a conic, where p is the semilatus rectum, so that I is 
oriented toward the pericenter of the relative orbit and its length is the orbital eccentricity. 
From Eq. (4.23), it turns out that orbital classification depends on the sign of the barycentric 
energy: in particular, / > 1 for E, > 0 and the relative orbit is hyperbolic, while / = | for 
E, = 0 and the corresponding orbit is parabolic. For completeness, in Exercise 4.5 the 
basic properties of elliptic orbits (0 < / < 1, E, < 0) are finally determined. 

Now, as the total energy and the total angular momentum are conserved, they can be 
evaluated on the relative orbit at t = —oo (i.e., on the asymptote at the beginning of the 
impact), and from Figure 4.1 and Eq. (4.9) 


llv(—oo) ||? 

a 
where b is the so-called impact parameter. The impact parameter can then be seen geo- 
metrically as the minimum distance that the two bodies would reach on the relative orbit in 
the absence of interaction (see Figure 4.1), or dynamically, as a measure of the barycentric 
angular momentum of the pair, with large b corresponding to large angular momentum at 
a given asymptotic relative velocity. Again from Figure 4.1, it is obvious that the direction 


I|Jrll = Ld\|v(—co) |], Er = (4.24) 


7 The literature on the Laplace-Runge—Lenz vector and its possible generalizations is enormous; interesting introductory papers 
are Goldstein (1975) and Yoshida (1987); see also Exercise 5.21. 
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of the asymptote with respect to the direction of the pericenter (fixed by the direction of I) 
is obtained by finding the zero of the denominator in Eq. (4.22), i.e., 


b? \|v(—00) |I* 


cos(Ag) = -7 l= ji + 
where the expression for / is obtained for the hyperbolic orbit from Eqs. (4.23) and (4.24). 
As expected, when b — oo we obtain Ag = 27/2, while Ag = z for b — 0. Moreover, as 
we have seen, a rotation of the relative orbit of 7/2 corresponds to Ag = 37/4, and from 
Eq. (4.25) this translates (prove it!) to the special value bz /2 of the impact parameter, and 
finally to a particularly simple expression for / in terms of b and by 2: 


GM be 
Inj2 = V2, bejg = ———,, |= J1+—-. (4.26) 
Iv(—o0) | Be, 
Exercises 


4.1 Obtain the general transformation formulae for the total energy and linear and angular 
momentum of the two-body problem as seen from two inertial reference systems 
moving with relative velocity V = dR/dt. Specialize to the case V = Vcm, further 
restrict to the case R = Rem, and finally prove Eq. (4.5). Hint: See Appendix A.1.1 
for the general case. 

4.2 Show that the change in the kinetic energy of the test star m; in the reference system 
So, defined in Eq. (4.17), can be written as 


mt 2 
AE. = > [llAvl? +2(Av. vi(—00))] 
= AE, + AEy + mi(Avi,vi(—00)), (4.27) 


where AFy, = m;|| Avi, || /2 and AE ty = my|| Avey ||? /2 are the energy gains asso- 
ciated with Ay;, and Av, defined in Eq. (4.16). Prove the equivalence of Egs. (4.18) 
and (4.27). The first two terms in the last expression of Eq. (4.27) can be interpreted 
as “heating,” and in Chapter 8 we will prove that the last term is a “cooling.” Hint: 
Use the identity y,(+00) = v;(—oo) + Ay; in the definition of AF; and then use 


Eq. (4.16). 

4.3 Prove Eq. (4.19). Hint: After time differentiation, use Eq. (A.22) with b = v and 
a=c=r. 

4.4 Prove Eqs. (4.22) and (4.23). Hints: For Eq. (4.22), multiply Eq. (4.21) by r and use 
the identity (vA J,,r) = (r A v,Jr) = |lJrll?/. This also proves the expression 


of p in Eq. (4.23). In order to express the modulus /, consider the squared norm of 
Eq. (4.21) and the fact that ||v A Jr |? = ||v||7 Jel? from the orthogonality of v and 
J,. Obtain 
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2Gmm¢ 
GPmimgl? = G?mimg — — Wel? + IVP Sel? 
= Gt? 4 2 (aa = 
ly 2 r 
2\Jr ll E. 
= G?m2m? + nt a (4.28) 


4.5 The solution of the two-body problem was one of the greatest achievements of clas- 
sical physics, providing the theoretical explanation for the three Kepler (1610) Laws 
of planetary motions. By using Eq. (4.22), prove that in the case of negative relative 
energy, when the orbital eccentricity is 0 < / < 1 and the orbit is elliptic, the 
apocenter and the pericenter are given by 


Tmax = 1—P min = ter (4.29) 
so that the semimajor and semiminor axes are 
P Gmymne 7 Pp Jill 
(i ‘ b=a 1 P= = A (4.30) 
1-2 2E,| VI-2 J2Eri 
Finally, show that the period of the orbit is 
2unab — 2na?/? 
ie aco M Stam. (4.31) 


Jel SGM’ 


Hints: From the conservation of angular momentum, and using polar coordinates in 
the orbital plane, conclude that 


dy dA 
2 

= — =2u—, 4.32 
Sell = wr") = ue (4.32) 
where dA = r7(y)dg/2 is the area element as a function of y. Then integrate over 
the period P and recall that A = sab is the area of the ellipse. 


4.6 By using Eas. (4.14), (4.25), and (4.26), show that 


b/bz/2 2||v(—o0) || 


; Avy || = 
1+ 32/82, vay 1+B2/b2 


| Avi || = 2[lv(—oo)I (4.33) 
Obtain the leading term of the asymptotic expansion in the case of very large barycen- 
tric angular momentum (i.e., for b — 00). 

4.7. Consider the two-body problem with interaction energy U(r) allowing for unbound 
orbits. Write Eq. (4.9) in polar coordinates and show that the rotation angle Ag in 
Figure 4.1 can be written as 


— Udell f° dr i= I|JrlI* 
= ’ e = 
V2 Tata r24/ E, — Veg(r) yr 


Ag + U(r), (4.34) 
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where /min 1s the solution of the equation 
E, = Uegt(rmin)- (4.35) 


Then compute the integral for the 1/r? force and show that Eq. (4.25) is reob- 
tained once Eq. (4.24) is used. The student attempting this exercise will appreciate 
the elegance of the approach based on the Laplace—Runge—Lenz vector. Hints: For 
the derivation of Eq. (4.34), consult Landau and Lifshitz (1969) or Goldstein et al. 
(2000). In order to perform the integration, write EF, as in Eq. (4.35), change the 
variable to x = 1/r, and compute xmin = 1//min by solving a quadratic equation; 
trigonometric identities will finally conclude the proof. See also Goldstein et al. 
(2000) for a list of special values of a allowing for the integration of Eq. (4.34) 
in terms of special functions. 

Equation (4.34) cannot be solved in closed form for generic U(r), even restricting to 
the class of power-law 1|/r® forces in Eq. (1.13). However, it is possible to obtain Ag 
in the case of large angular momentum (i.e., for b — oo). Show that for 1 <a < 3 


and U(r) = —Gmm,r!~*/(a — 1) an asymptotic expansion® gives 
TA Gmyume la 
Ag ~ B{ =, ; 4.36 
° 5 aoe (75) ey 


where B(x, y) is the Euler beta function in Eq. (A.53). Moreover, from Eq. (4.15), 
prove that for distant encouters 


lava Gm,me¢ b B (; 5) (4.37) 
Vv , 5 F : 
one b* Iv(—ooy) \2"2 


Hints: First, show that the leading term of the solution 7in of Eq. (4.35) for b > co 
iS ‘min ~ D, independent of the value of a. Then expand the integrand in Eq. (4.34) 
for b — oo. The integration of the first term is trivial, but the second requires some 
care, and it can be performed in parts. 

With this exercise we prove the validity of a classical approach to the computation 


of the orbital deflection in case of a large impact parameter (e.g., see Binney and 
Tremaine 2008; see also chapter 4 in Landau and Lifshitz 1969). Consider again the 
force in Exercise 4.8, and assume that for large b the relative orbit remains straight 
and the relative velocity constant. Show that the time integral of the component of 
the force perpendicular to v(—oo) along the fictitious path is 


TOO dt 


pl Avail ~ Gmamd | (4.38) 


“et [b+ Il¥(—00)|/72| 


a+l’ 
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and that it evaluates to Eq. (4.37). Hint: Decompose the instantaneous force along the 
rectilinear orbit as g = g, + gj and recast the integral as a complete beta function in 
Eq. (A.53). 


8 The lower limit on w is needed to allow for unbound orbits and the upper limit is needed to exclude the possibility of a 
collision of the two particles, even for a nonzero angular momentum. 
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Quasi-Circular Orbits 


This chapter presents the basic properties of quasi-circular orbits in axisymmetric stellar 
systems. In fact, axisymmetric models are often sufficiently realistic descriptions — beyond 
the zeroth-order spherical case — of elliptical and disk galaxies. The associated potentials 
(with a reflection plane, the equatorial plane) admit circular orbits, and in this chapter we 
focus on the properties of orbits slightly departing from perfect circularity, describing in 
some detail the second-order epicyclic approximation. We also derive, in a geometrically 
rigorous way, the expression of the Oort constants, which are important kinematic quanti- 
ties related to the rotation curve of disk galaxies, and in particular to the orbit of the Sun 
around the center of the Milky Way. 


5.1 Orbits in Axisymmetric Potentials 


Stellar orbits in generic gravitational potentials can be very complicated, and explicit solu- 
tions of the Newton equations of motion cannot in general be found in a closed form. This 
may appear to be a drawback to the student, as the availability of explicit solutions cer- 
tainly seems desirable. However, even when available, a very complicated formula seldom 
leads to a deep understanding of the physics behind the problem, and any gains achieved 
from such complicated formulae may be illusory. At the same time, the work needed to 
understand a problem that cannot be solved in full analytical form quite often is more 
useful than the knowledge of the formal solution itself. As we will see, this is exactly the 
situation we face in this chapter (i.e., orbits in axisymmetric potentials); in fact, even if 
the hypothesis of perfect axisymmetry for the potential lead to great simplifications, for 
a generic #(R,z) in general it is not possible to obtain a closed-form expression for the 
orbits, except for very special cases such as circular orbits in the equatorial plane and axial 
orbits along the symmetry axis. Here, thanks to a very clever idea originating from the great 
mathematicians of ancient Greece, we will study the properties of orbits slightly departing 
from circular paths, by using epicyclic theory (e.g., see Gallavotti 2001; Schiaparelli 1926 
for presentations of remarkable clarity of how epicyclic theory translates in the language 
of modern physics, being a powerful mathematical tool). Of course, axisymmetric systems 
are of special interest in astrophysics, as many stellar systems are flattened and, at first 
approximation, axisymmetric. For example, spiral galaxies and SO galaxies are perhaps the 
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most prominent examples, but we should also remember accretion disks around black holes 
and planetary rings (e.g., those of Saturn). 

We begin by writing the equations of motion for a particle/star (of unit mass) in a generic 
axisymmetric systems whose potential is #(R,z) and (R, ¢g, z) are the standard cyclindrical 
coordinates. It is a useful exercise for the student to derive the equations from the Newton 
equations of dynamics in vector formulation, but here we follow the shorter Lagrangian 
approach. The Lagrangian function of our star is 

R2 32 ( R ¢)* 


L= ie eer $(R, 2), (5.1) 


so that the equations for the three generalized coordinates are obtained from the well- 
known Euler-Lagrange equation d(dL/0q;)/dt = 0£/0q;. In particular, the momentum 
conjugate to the coordinate g is conserved, and this momentum is none other that the z 
component! of the angular momentum of the star, i.e., 


Jz = RQ. (5.3) 
The equations for the two remaining coordinates can be written as 
‘ a) 0 0 
foo eo." (5.4) 
aR dz dz 
where the effective potential 
i 
be(R,z) = O(R,z) + FR? (5.5) 


has been introduced. Therefore, the motion of the star would be known if one first solves 
Eq. (5.4) in the meridional plane (R,z) and then solves Eq. (5.3) for the time evolution of 
the angle ¢~; by construction, ¢ is also the instantaneous angular velocity of the meridional 
plane. Notice that, independently of how complicated the time evolution of g is, g never 
changes sign, as J, is constant and R(t) > 0: no “inversion” in the sense of rotation of the 
angular coordinate can occur in axisymmetric potentials. 

An important point to be appreciated is that the reduction of the equations of motion 
from 7 to the meridional plane is done by fixing the value of J; initial conditions asso- 
ciated with different values of J, evolve into different effective potentials. We will refer to 
the set of orbits corresponding to a given J; as an orbital family. Of course, fixing J, by 
no means fully constrains the orbits, as an infinite number of different orbits (and different 
meridional planes!) exist for a given J. 

From Eq. (5.4), the energy (per unit mass) in the meridional plane of each orbit 

we. 2 


ve 
En = > + oy + @e(R,z) = E (5.6) 


| Notice that from Exercise 9.1 and Eq. (A.144) the angular momentum per unit mass in cylindrical coordinates reads 


J=xAv=—zvofp + (cup — Rvz)fg + Rugf, = Jrfp + Joty + Jzf- (5.2) 
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is conserved, and its value coincides with the value E of the energy of the orbit in physical 
space, as obtained from Eq. (5.1). It immediately follows that the motion is limited to the 
region of the meridional plane defined by 


E > @(R,Z), (5.7) 


where the equality identifies the zero-velocity curve (i.e., the locus where vr = v, = 0); of 
course, when a star hits the zero-velocity curve, its velocity is in general not zero (think of 
Earth’s velocity at the perihelion and at the aphelion!) because from Eq. (5.3) vg = J;/R. 
In order to develop our epicyclic theory, we identify the most important member of each 
orbital family (i.e., the circular orbit in the equatorial plane corresponding to the given J,). 
Circular orbits for a given value of J, are obtained by setting to zero the accelerations in 
the meridional plane from Eq. (5.4), and it is easy to show that for regular potentials with a 
reflection plane (i.e., @(R,z) = @(R, — z)) the radius Ro of the circular orbit(s) of angular 
momentum J, in the equatorial plane is given by the solution(s) of the equation 


d¢(R,0) 


2 3 
= ROR 


(5.8) 


obtained from the equilibrium points of Eq. (5.4). In particular, the rotation curve of the 
potential in the equatorial plane is given by 


a¢(R,0)  J2 
pee 
aR R2 


u2(R) = Q7(R) R? = (5.9) 
provided the radial derivative of the potential is positive. In the following, we will assume 
that this condition is satisfied (the usual condition!), but we recall that it is possible to 
construct nonspherical, nowhere-negative density distributions for which the positivity of 
2 is violated (e.g., see Exercise 5.7). Of course, in the special case of spherically symmetric 
mass distributions, from Newton’s second theorem the rotation curve at distance r from the 
center depends only on the mass M(r) contained in a sphere of radius 7, i.e., 


VU 


GM(r) 


ve(r) = > 0. (5.10) 
Equations (5.8) and (5.9) allow us to determine Ro for the assigned potential and J;. 
However, Ro of an orbital family can also be determined by assigning the orbital total 
energy in Eq. (5.6) as 


e(R Ro 3¢(Ro, 0 
Em = be(Ro,0) = =P + 6 (Ro,0) = PEO 


+ $(Ro, 0). (5.11) 
See also Exercise 5.9. 

We conclude this preliminary section by reminding ourselves of the fundamental impor- 
tance of the study of rotation curves of cold gas (in particular by using the 21-cm emission 
line of neutral hydrogen, HI) in spiral galaxies, one of the major diagnostics for the study of 
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dark matter in disk galaxies” (e.g., van Albada et al. 1985); we do not discuss this subject 
here, as it is treated extensively in the literature (e.g., see Bertin 2014; Bertin and Lin 1996; 
Binney and Tremaine 2008 and references therein). The exercises provide several cases of 
the explicit construction of rotation curves in the most used (and idealized) disk models; the 
student is encouraged to plot the resulting curves for different choices of parameters and 
to verify that quite flat rotation curves can be easily produced inside a disk by the field of 
the disk itself without need for dark matter. Only far outside the optical (stellar) disk does 
the flat rotation curve require a stronger field than that provided by luminous (baryonic) 
matter, which in those regions would be dominated by the monopole term of the disk (see 
Chapter 2), with the characteristic Keplerian falloff. 


5.2 Second-Order Epicyclic Approximation 


The idea behind the epicyclic approximation discussed here is to expand the effective 
potential of an orbital family up to the second order that is inclusive of small displacements 
around a circular orbit of radius Ro for fixed values of J,. We assume R(t) = Ro + r(t) 
with |r(t)|/Ro « 1 and |z(t)| being sufficiently small to guarantee the expansion, so that 


2 2 


Kri2, k& 
be(R,z) ~ he(Ro, 0) + Pd ae 2 


where the radial and vertical epicyclic frequencies are defined as 


a? a2 
«a = Pe ; Ke = Pe 
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z. (5.12) 


(5.13) 


2 2. 
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Note that in Eq. (5.12) the two linear terms in the displacements vanish identically on 
the circular orbit of radius Ro, that the mixed second-order derivative vanishes for the 
regularity and reflection properties of the potential on the plane z = 0, and finally that 
Kz is independent of J,. If we now insert Eq. (5.12) into Eq. (5.4), we see that at second 
order the equations of motion for the displacements are those of two independent harmonic 
oscillators 


#=—Kgr, Z=—K?z, (5.14) 


with oscillatory solutions, provided Ke > Oand K? > 0. Therefore, we have stable perturbed 
orbits for real values of the epicyclic frequencies and (linearly) unstable motions when at 
least one of the two frequencies is imaginary. In particular, it is easy to prove (do it!) that 


1 dJ2(R) _ 2Q(R) dQ(R) R* 
R32 dR |r-pn, dR |p_R, 


«2(R) = (5.15) 


2 Observationally, the circular velocity of HI in disk galaxies of large radii presents a characteristic flat shape with a remarkable 
proportionality between the total stellar light of the disk and (approximately) the fourth power of the circular velocity: the 
so-called Tully—Fisher (Tully and Fisher 1977) relation. 
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where J.(R) = Rvuc(R) is the angular momentum (per unit mass) of circular orbits in the 
equatorial plane as a function of R; of course, for a given orbital family, J.(Ro) = J;. From 
Eq. (5.15) follows the very important (and very useful in practical applications) Rayleigh 
criterion (i.e., stable circular orbits can exists only if J¢(R) is a monotonically increasing 
function of the radius). Such a result can be extended to fluid dynamics, leading to a stability 
criterion for equilibrium configurations of rotating fluids (e.g., Tassoul 1978). 

From Eqs. (5.7) and (5.12), it is now elementary to obtain the expression for the zero- 
velocity curves in our approximation, with 


2 


KR 2 a 2 
ad +32 < En — de(Ro,0). (5.16) 


For linearly unstable orbits the zero-velocity curves can be open, while for stable orbits the 
zero-velocity curves are ellipses in the meridional plane. Of course, for increasing values 
of Ey the shapes of the true zero-velocity curves as obtained from the full Eq. (5.7) will 
in general depart more and more for the ellipses of the second-order expansion (e.g., see 
Exercise 5.10). 

We now proceed with the discussion of epicyclic orbits, limiting ourselves to the case of 
stability. The solution of the linearized equations of motion in Eq. (5.14) is 


r(t)=rocos(krt), z(t) = zo sin(k-t), (5.17) 


where for simplicity we assume that at time t = 0 the orbit is on the equatorial plane and 
at maximum displacement from the center (apocenter). From the geometry of the problem, 
the resulting orbit can be expressed in vector notation as (prove it!) 


x(t) = Ro(t) + C()x’ = C(t) (Rofi +x’), (5.18) 


where x’(t) = [X(t), Y(t), z(t)] is the orbit as seen from the rotating reference system of 
the deferent (also known as the guiding center; see Figure 5.1), f; is the unitary vector 
aligned with the x-axis, Ro(t) is the position of the deferent, and finally C(r) is the rotation 
matrix in Eq. (3.34); with this equation, the student can draw the orbits in their preferred 
potential (e.g., see Figure 5.2). 

We now move to study the properties of x’(t) in some detail. From Eq. (5.17), the orbit 
will reach the pericenter after a time of 7/kp, return again to the apocenter after the radial 
period Tr = 27/kp, and so on; in the time Tr between two passages of the orbit from the 
apocenter, the circular orbit of radius Ro (the so-called deferent) rotates by the amount 


2nQ 
ea (5.19) 
KR 


As in general kr and x, depend on Ro, we can conclude that if for some special value of 
Ro the equality 

29 = Ke (5.20) 
holds for m and n integer numbers, then the orbit will close in the inertial reference system 
after m revolutions of Ro and n complete radial oscillations of the epicycle (i.e., n radial 
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Figure 5.1 Schematic representation of a deferent orbit of radius Rg (rotating counterclockwise) and 
of the associated epicyclic ellipse (rotating clockwise) in the equatorial plane of an axisymmetric 
potential. The instantaneous value of the angle }(t) = g(t) — Qot is given by Eq. (5.22) and the 
distance of the orbit from the center is Rg + r(f). 


“inversions” of the orbit; again see Figure 5.2). We will return to this important point at the 
end of this section, while in Exercise 5.20 we will use Eq. (5.19) to obtain the expression 
for orbital precession/recession angle, which is a very important property of quasi-circular 
orbits with consequences for classical mechanics and general relativity (e.g., see Landau 
and Lifshitz 1971). 

As was already well understood in antiquity, the appearance of the epicycle is of the 
utmost importance when observed from the guiding center. This is particularly important 
in the (X, Y) plane (i.e., the equatorial plane of the potential; see Figure 5.1), and we now 
solve this problem using the methods of analytic geometry. The first step is to obtain the 
differential equation for the instantaneous relative position of the meridional plane of the 
star with respect to the position of the meridional plane of the deferent. Expanding at the 


first order Eq. (5.3) in terms of the small number ro/Ro, one gets 
. 2Q20 
g = Qo —- —ri(t), (5.21) 
Ro 


so that the instantaneous angular distance between the guiding center and the star in the 
(X, Y) plane is given by 


Qo 


. 2 
910) = [ @— Sat = - 
0 KRR 


ro sin(kRt). (5.22) 
0 
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Figure 5.2 Two examples of rosette orbits x(t) in the equatorial plane of an axisymmetric potential 
built using epicyclic approximation and Eq. (5.18). The lengths are normalized to the deferent radius 
Ro and (79/Ro,KR/ Qo) = (1/10,3/2) for orbit a (heavy line) and (1/2,5/3) for orbit b. Both 
orbits start at positions 1, and after a first rotation of the deferent they reach positions 2. After 
a second rotation of the deferent orbit a closes, while orbit b closes after a third rotation of the 
deferent. Consistently with the adopted parameters, orbit a presents three apocenters/pericenters 
after two rotations of the deferent, while five are presented by orbit b after three rotations of 
the deferent. 


Now, an elementary geometric construction (see Figure 5.1) shows that the Cartesian coor- 
dinates of the epicyclic orbit as seen from the guiding center are 


X(t) = [Ro + r(t)] cos 0(t) — Ro ~ r(t) = ro cos(kpt), 


205 (5.23) 
Y() = [Ro +r()] sin B(t) ~ Rod) = ———rosin(ert), 
R 
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where in the second identity the first-order expansion is assumed. In practice, the epicycle 
is an ellipse in the reference system of the deferent given by 


Y(t) 7 


XC + =r. 5.24 
(t) 42 Jac To ( ) 


Therefore, the ratio of the guiding center frequency to the radial epicycle frequency not only 
determines whether the orbit is closed or open, but also the shape of the epicycle! Note also 
that the motion on the epicycle is opposite to the motion of the guiding center (another 
property that was well known to ancient astronomers), a consequence of the conservation 
of angular momentum: when the star is outside Ro its angular velocity is subtracted from 
Qo, and the opposite happens when the star is inside Ro. 

Two special cases of the utmost importance must be considered: the first is that of 
the Coulomb potential of a point mass, in which kr = 20; and the second is that of 
the harmonic oscillator (Hooke’s law inside a constant-density sphere), when kr = 22. 
Note that these identities hold independently of Ro (i.e., in both cases all of the orbits in 
the epicyclic approximation are closed). This is simply a manifestation of the so-called 
Bertrand’s theorem (i.e., the fact that the only potentials admitting only closed (bound) 
orbits are indeed the Coulomb and the harmonic oscillator cases; e.g., see Arnold 1978; 
Goldstein et al. 2000; Landau and Lifshitz 1969). It also follows that for realistic (.e., 
radially decreasing) spherical density distributions, it is expected that Qo < kr < 2Qo0 
moving from very concentrated to very shallow density profiles, such as in the case of the 
singular isothermal sphere kr = /2Qo (see Exercise 5.13). A few examples of the radial 
trend of Eq. (5.20) for an idealized family of disks are shown in Figure 5.3, where the 
special behavior of Q9 — kr/2 ~ 0 over a large range of radii can be seen: this behavior is 
much more than a simple curiosity, instead being a quite general property of quasi-circular 
orbits in disks. Actually, this property opens the way to the investigation of the difficult 
subject of spiral structure, which is well beyond the scope and the techniques presented in 
this introductory book (e.g., see Bertin 2014; Bertin and Lin 1996; Binney and Tremaine 
2008 and references therein). 


5.3 Oort’s Constants 


Another important use of circular orbits in astronomy is the deduction of the so-called 
Oort’s constants (e.g., Bertin 2014; Binney and Merrifield 1998; Binney and Tremaine 
2008; Chandrasekhar 1942; Ogorodnikov 1965). In this section, we will derive these con- 
stants geometrically under the idealized assumptions of perfectly circular orbits, all placed 
in the equatorial disk of a razor-thin galaxy, as illustrated in Figure 5.4. We indicate with 
Q(R) = Q(R)e, the angular velocity of the stars in the disk at distance R = ||R|| from 
the center, and we assume that the orbital plane is the (x, y) plane, so that e, is the unit 
vector directed along the z-axis and the velocity of circular orbits of radius R is given 
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Figure 5.3 Radial trend of the quantity Q9 — mk p/n for the Miyamoto—Nagai disk in Eqs. (13.62) 
and (13.63). Dotted and solid lines refer to values of the flattening parameter a/b = 1 and a/b = 5, 
respectively; higher values of the flattening parameter correspond to flatter models. In each of the 
two families, the ratios m/n from the upper to lower curves are —1/2, 0, 1/2, and 1. Note how the 


curve corresponding to 29 — kr/2 is quite flat over a large radial range. G = universal gravitational 
constant; M = total disk mass. 


by v = QAR. The Sun is located? at Ro = Roeo, where eo is the unit vector in the 
galactic plane pointing toward the galactic anticenter, so that Q(Ro) = Qe. It follows 
that the velocities of the Sun and of a generic star around the galactic center are given 
respectively by 


Yo =Qoe,ARo, Ve = 2(R)e, AR. (5.25) 


3 Note that in this chapter Rg is not the radius of the Sun, but the distance (assumed constant, ~8 kpc) of the Sun from the 
galactic center. 
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Two more unit vectors in the galactic plane, parallel and perpendicular to the vector | joining 
the Sun and the observed star, must be introduced, i.e., 


1=R—Ro, /= III, (5.26) 
with 
] 
e| = r’ ee; =e, Ae, (5.27) 
so that the relative velocity V of the star with respect to the Sun 
V=V,—-Vo = QAIF (Q—Q_) A Ro (5.28) 
can be decomposed into the parallel and perpendicular components as 
Vi, = (Ve), Vi = (V,e1). (5.29) 


We start by discussing the parallel component. From Eq. (5.28), the first term of the product 
of V with e; vanishes because 1 = / e), while the second product is easily evaluated from 
Eq. (A.24), so that (e, A eg, e) = (€@ A e],€-). From inspection of Figure 5.4, we then get 
the exact expression 


Along the same lines, we now derive the exact expression for the perpendicular velocity 
component 


Vi = (Vie, A ey) = (@;,e) A V). (5.31) 


In order to evaluate the two components of product ey A V (which is obviously parallel to 
e,!), we use the vector identity (A.22), taking into account that (e,,e) = 0. Simple algebra 
shows that 


Vi = Q14 (Q—Qo)Ro cos ¥. (5.32) 


Equations (5.30)-(5.32) are exact under the hypotheses of the problem (i.e., they hold 
independently of the distance between the observing point (the Sun) and each point in 
the disk); moreover, they can be immediately expressed in terms of the galactic longitude, 
which is the angle A = % + a measured counterclockwise from the galactic center (see 
Figure 5.4). 

We now restrict ourselves to places near the Sun (i.e., we consider stars with] << Ro), 
and we expand the angular velocity in Eqs. (5.30)—(5.32), retaining terms up to first order 
inclusive in /. From a Taylor expansion of the function Q(R), one gets 


2 (Ro +1ll) = Qo + 2'(Ro)(€o,1) + OW), (5.33) 
where Q! = dQ/dR and eg = Ro/Ro, so that (eg,1) = / cos @. Inserting Eq. (5.33) into 
Egs. (5.30)-(5.32), we see that in the Solar neighborhood 


Q/(Ro)R 
w= = WoiBe isin20 = Al sin29, (5.34) 
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Figure 5.4 Vector decomposition of relative velocities as observed from a point on a circular orbit in 
a infinitesimally thin disk made by perfectly circular orbits. 


Q'(Ro) Ro 
Vip = ]Qot+ + 5 lcos20 = B1+ Alcos2v, (5.35) 


eta ] 
2 
where A and B are two functions depending on Ro, called the first and second Oort’s 
constants. Notice that Eqs. (5.34) and (5.35), at variance with Eqs. (5.30)-(5.32), remain 
unchanged when the galactic longitude A is used instead of the angle ?. 

The Oort’s constants can be rewritten in terms of circular velocity instead of angular 
velocity because from Q(R) = v(R)/R 


dQ(R) , _ dv(R) v(R) 
dR ~ dR R? 


1/d 1(/d 
ha , Pee a (5.37) 
2\dR RR) pr, D\GR” RB) pope 


Therefore, if from observations we can know the values of the two constants and the 
distance of the Sun from the galactic center, we can also know the value of the orbital 


(5.36) 


and so 
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velocity of the Sun and its derivative with respect to the galactocentric distance “just” by 
measuring the relative velocity of the Sun with respect to bearby stars because 


A+B=(5) B A=(<) (5.38) 
NAR) per, AR] R=Ro 


Of course, measuring A and B from observations is a nontrivial task, which today is 
carried out using very refined observational and modeling techniques, that are well beyond 
the scope of this section. However, the logic of the procedure can be easily understood 
in its basic steps even in our elementary framework. First, one selects stars in the solar 
neighborhood at known distance / and different values of the galactic longitude A. From 
Doppler measurements, Vj is obtained from each star of the sample, and the constant A 
is adjusted so as to obtain the best fit of the data (Vj //,4) from Eq. (5.30). In order to 
measure the constant B from Eq. (5.32), proper motions are needed for the same sample 
of stars, so that V, can be measured. With the A constant fixed, B is obtained through a 
vertical shift of the plot of the points (V, //,2). Needless to say, the values obtained depend 
on the data set used, and their interpretation in terms of the rotation law of the galaxy is 
not as simple as it appears in Eq. (5.38), firstly because stellar orbits are more complicated 
than perfect and coplanar circles. In any case, observations reveal that (e.g., see Binney 
and Merrifield 1998; Binney and Tremaine 2008) in general A ~ 15 km s~! kpc! and 
B = —12kms7! kpc~!, with examples of very recent estimates being A ~ 15.3 km s~! 
kpc—! and B ~ —11.9kms7! kpe7! (Bovy 2017). In practice, from Eq. (5.38) it follows, 
among other things, that v < 0 (i.e., the Milky Way is rotating clockwise in a reference 
system with the z-axis pointing toward the Galactic North Pole (and so the total angular 
momentum vector of our galaxy points toward the Galactic South Pole)). 

This fact is at the origin of a long and established (and sometimes confusing) tradition 
of rewriting A and B in a way that explicitly takes into account the clockwise rotation of 
the Milky Way. Call Qyww and umw = Qmw R the modulus of the angular and circular 
velocity of the Milky Way, so that by fixing Q = —Qmw and v = —vumy in Eq. (5.37) we 
obtain the expressions of A and B that are almost universally encountered in the literature 


1 d 1 d 
Ae (“ a) B= (2s if a) (5.39) 


Exercises 


5.1 The rotation curve vc(R) in a given axisymmetric potential @(R,z) = o(R, — z) 
is the circular velocity as a function of radius R of orbits in the equatorial plane of 
the system given by Eq. (5.9). Show that for a stellar system made of different mass 
components (e.g., a bulge, a central supermassive black hole, a stellar disk, a gaseous 
disk, and a dark matter halo), the total circular velocity squared is the sum of the 
squared circular velocities of the components. By using the results of the following 
exercises and Eq. (5.10), the student is invited to experiment and construct the circular 
velocities for multicomponent galaxy models. 


5.2 


5.3 


5.4 
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From Eqs. (2.104) and (5.9), show that in spherical coordinates the circular velocity 
in the equatorial plane of an axisymmetric, razor-thin disk of surface density U(r) 
can be written as 


v2(r) = 2nG v1 P2/(0)] aE ae [ Eee de = 2 i. | 


al 
1=0 § 


(5.40) 


Isolate the 7 = 0 term and show that it can be written as GM(r)/r, where M(r) is 
the mass of the disk contained inside a circle of radius r; comment on the origin of 
the additional terms with respect to v2 (r) of spherical systems in Eq. (5.10). Repeat 
the exercise with &(R) in cylindrical coordinates, and from Eq. (2.105) prove that 


v2(R) = 276 | SdH kR)k dK = 4G | DEVE, R)EdE, (5.41) 
0 0 


where 
K(R E(R 
ei8) Huy hee 
V(éE,R) = 5.42 
(§, R) RECE/R) : (5.42) 
Rage NPE 


Hints: Read again Exercise 1.9. By using Eq. (A.78), compare Eq. (5.40) with 
eq. (2.263) in Binney and Tremaine (2008). The first identity in Eq. (5.41) is 
established from the first identity in Eq. (2.106) following Eq. (5.9) and using 
Eq. (A.87) and the second identity is established by evaluating the radial derivative 
of the last integral in Eq. (2.106) by using Eq. (A.66) or by the radial derivative of 
Eq. (2.107). 

Show by direct differentiation of Eq. (2.113) that the circular velocity of the razor- 
thin exponential disk in Eq. (2.110) with R=R / Raq can be written as 


2h) =a Gsnme yf R 1(®)\x,(% 5.43 
ve(R) = wGXoRag o(5 ) Kol; 1\5 is]: (5.43) 


Repeat the exercise with spherical coordinates by using Eqs. (5.40) and (A.51). We 
recall here the relevant reference of Casertano (1983), where the rotation curve of 
the finite-thickness and truncated exponential disk (with a soft cutoff; see also the 
following exercises) has been presented. 

By using Eq. (5.40), show that the circular velocity in the equatorial plane of the 
(truncated) Mestel (1963) disk in Eq. (2.115) can be written as 


Ri; ie) : R 21+1 
— P2/(0 — : > Rt, 
+ Dew or (=) r>R 


ve(r) = 2nGLaRa x - (5.44) 


21 
1+ Siew (£ -) , O<r=R, 
f=1 
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in agreement with the expression obtained by differentiation of Eq. (2.116). First, 
from the divergence of the series for r = R; (prove it!), conclude that the potential 
there is continuous with a vertical inflection point and uc(R,) = oo. Then, from the 
general result in Exercise 5.2, show that the constant term of the series, for points with 
r < Rt, is v2(0) = 27GDaRa = GM(r)/r, and that v2(r) = v2(0) = GM(r)/r at 
all radii in the untruncated case (Ri —> 00), in formal* analogy with the case of the 
singular isothermal sphere in Eq. (2.63). 

This exercise again focuses on Mestel (truncated) disk. From differentiation of 
Eq. (2.117), show that with cylindrical coordinates 


5 5 e 
=K{=), R>6, 
R R 


u2(R) = 4GLaRa x (5.45) 


where 6 = R,/ Ra and R=R / Ra, and by using Eq. (2.118), verify the equivalence of 
Eqs. (5.44). Consider the asymptotic limits for R — oo at fixed 6 and for 5 > oo at 
fixed R, and prove again that for the untruncated disk the circular velocity is constant. 
Finally (as a more difficult exercise), show that Eq. (5.45) can also be obtained by 
performing the integration in Eq. (5.41) using the second identity in Eq. (5.42). Hints: 
In order to prove Eq. (5.45) with direct differentiation, rewrite the last integral in 
Eq. (2.117) for points inside and outside R, and change the integration variable to 
t = &/R before differentiating. For the evaluation of the asymptotic limits, recall 
that K(O) = 2/2; notice that K(k) diverges for k — 1, confirming again that vu, 
diverges on the rim of the truncated Mestel disk. Notice that in the untruncated case 
from eq. (6.511.1) of Gradshteyn et al. (2007) 


. Lak 
S(k) = 1 co (5.46) 


and so the first integral in Eq. (5.41) and eq. (6.511.1) of Gradshteyn et al. (2007) 
prove that v2(R) = 21GXaRq. For the last question, the integration for points at 
R > R, presents no difficulties by using eq. (5.112.11) of Gradshteyn et al. (2007) or 
evaluating d[kK(k)]/dk with the aid of the first identity of Eq. (A.66). For points at 
R inside the disk, split the integration region as (0, R — €) and (R + €, R;), evaluate 
the inner integral as before and the outer integral from the first identity of Eq. (A.66), 
and show that the logarithmic singularities cancel for € — 0. 

We finally consider three truncated razor-thin disks in order to elucidate the effect 
of truncation on the rotation curve at the disk edge. For each of these models, we 
only consider the equatorial plane and we use cylindrical coordinates; the student is 


4 The student should be careful to avoid the conclusion that the mass distribution of the untruncated Mestel disk at r > rg does 
not affect ve at r < rg just because ve(r) = GM(r)/Tr. If this were true (as it is for spherical systems!), then ve (r) of the 
truncated Mestel disk, as obtained by removing the outer parts of the untruncated disk, would be constant (as happens inside 
the truncated isothermal sphere), which would be at variance with the second identity of Eq. (5.44). 
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invited to integrate Eqs. (2.104) and (2.105) and to prove the following results. First, 
for a truncated constant-density disk (Mestel 1963) of total mass Mg = 2 DoR? 


x(R) = XA —R), R=R/R, (5.47) 


: K(k) —E(R), 0<R <1, 
u2(R) =4GXoR, x 4 | as (5.48) 
RK(1/R)— RE(/R), R>1, 


where K and E are the complete elliptic integrals of the first and second kind in 
Eq. (A.65). Second, for the Maclaurin disk (Binney and Tremaine 2008; Kalnajs 
1972; Mestel 1963; Schulz 2009) of total mass Mg = 27 DoR?/3 


x(R) = Xov1— R20(1—R), R=R/R, (5.49) 
a R2 . 
; = §<R <1; 
vo(R) = wGXoR, x 2 (5.50) 


Rarcsin(1/R) — VR2—1, R>1. 


Third, for the finite Mestel disk (not to be confused with the truncated Mestel disk; 
Lynden-Bell and Pineault 1978; Mestel 1963; Schulz 2012) of total mass Mg = 
2m DR? 


arccos R ~ ~ 
X(R) = Do a(1—R), R=R/R, (5.51) 
aT « 
=~ O<R<1, 
u2(R) = 2nGXoR, x 4 2 (5:42) 


arcsin(1/R), R> il, 


with a constant circular velocity inside the disk. Notice that ve(R) in Eq. (5.48) jumps 
to infinity at the disk edge, as in the truncated Mestel disk in Eq. (5.45). This is 
a general feature of abruptly truncated razor-thin disks, as is discussed in Mestel 
(1963), and it can be easily proved from the last integral in Eq. (5.41) expanding the 
second integral of Eq. (5.42) near the disk truncation radius R;. The other two disks 
present soft truncations. Hints: We use Gradshteyn et al. (2007). For the potential of 
the truncated constant-density disk, from eq. (6.561.5) recover 


A Jia 
H(k) = TR? it ). AER (5.53) 


then use eq. (6.574.1); transform the hypergeometric functions into complete ellip- 
tic integrals with the aid of Gauss’ recursion formulae (9.137) and eqs. (8.113.1) 
and (8.114.1). For ue(R), use eq. (5.512.1) and convert the expressions again in 
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terms of complete elliptic integrals. For the potential of the Maclaurin disk, from 


eqs. (6.567.1) and (8.464.3) recover 


A m S372(A) sind — AcosA 
S(k) = DoR? i ap = 2p R} — 3 — A=KRt (5.54) 


then use eq. (6.574.1), and finally reduce the hypergeometric functions to elementary 
functions. For vc(R), use again eq. (6.574.1) and convert the expressions in terms of 


elementary functions. For the potential of the finite Mestel disk, first prove that? 


Si(A) 
ey 


S(k) == ; 


A=kR, (5.55) 


where Si is the sine integral function in Eq. (A.61). The circular velocity can be 
obtained from eq. (2.12.48.2) of volume 2 of Prudnikov et al. (1990), or with inte- 
gration by parts using Eq. (A.87) with Si(oo) = 2/2, Si(0) = 0, Jo(oo) = 0, and 


Jo(O) = 1, and then eq. (6.693.6) from Gradshteyn et al. (2007). 


5.7 Calculate the circular velocity for the family of power-law tori in Eqs. (2.87) and 


(2.88) and show that 


2 
>) 2 a- —9a+ 16 ~ Ay 
R) = —47G R : 
VAR) = AE CMe ar aS =a =D) 


(5.56) 


Discuss the result for 2 < a < 5 and obtain the constant value of the circular velocity 
for a = 4. What happens for 2 < a < 9/2 — V17/2 ~ 2.44? Hint: Consider the 
sign of the radial component of the force in the equatorial plane; see also Ciotti and 


Bertin (2005). 


5.8 From Eq. (5.4), it follows immediately that circular orbits in a given orbital family 
are critical points for the effective potential. From Eq. (5.6), show that circular orbits 
are also critical points for the energy Em(R,Z, R,z) in the meridional plane. Finally, 


prove that E,, is a minimum in cases of stable circular orbits. Hint: Use Eq. (5.16). 


5.9 Show that the circular orbits in the equatorial (z = 0) plane of an axisymmetric 
potential #(R, z) are fully determined by assigning the value of one of the quantities 
Ro, Uc(Ro), Jz, or Em. By using Eqs. (5.8), (5.9), and (5.11), and for each possible 
choice among the four quantities above, show how to determine the values of the 


remaining three. 


5.10 Draw the zero-velocity curves for the potential of a point mass as a function of orbital 
energy and for a given value of J,, and discuss their shape compared to that obtained 


from epicyclic expansion in Eq. (5.16). What happens for E = 0 and for E > 0? 


5 Equation (5.55) can be proved by considering the integral / (A) = Ie Jg(At) arccos t dt, with 1(0) = 1. Differentiate with 


respect to A, use dJg(At)/ddA = —tJ, (At) = (t/A)dJ(At)/dt, and integrate by parts. From eq. (6.554.2) from Gradshteyn et al. 


(2007) obtain /’ = —//A + (sin n/n and solve the resulting ordinary differential equation with the given initial condition, 
obtaining J = Si(A)/A. 


D1 
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Show that in a spherically symmetric stellar system the vertical epicyclic frequency 
is related to the potential (7) by the identity 


ore ldo) 


_o2 
: a (5.57) 


Explain physically the identity above. Hint: In spherical systems the orbital angular 
momentum J of each star is conserved, and the orbits are forced to remain in the 
invariable plane perpendicular to J and so are necessarily planar (see Chapter 6). 
Calculate the epicyclic radial frequency around a point mass with the Yukawa 
potential 
Ae 
og=- , pw=d, (5.58) 


P = 


and discuss the stability of circular orbits as a function of jz and r. 


Calculate the epicyclic radial frequency in the potentials 
r 
g=-—, G=Al—, (5.59) 


and discuss the stability of circular orbits as a function of a. For the stable cases, 
calculate the axial ratio of the epicycles and the precession/recession angle Ag. What 
happens for a = 2 (i.e., for the 1/r? force) and in the logarithmic case (correspond- 
ing to the singular isothermal sphere in Exercise 2.1)? Finally, plot the two Oort’s 
constants A(r) and B(r) as a function of r. 
A phenomenological pseudo-Newtonian potential, designed to describe accurately 
the gravitational field near a (Schwarzschild) black hole of mass Mgu, has been 
proposed in an influential paper by Paczynsky and Wiita (1980): 

6a ga eK (5.60) 


r—rs’ Cc 


where rs is the Schwarzschild radius and c is the speed of light. Calculate the 
epicyclic radial frequency and show that circular orbits around a nonrotating black 
hole are unstable inside the so-called innermost stable circular orbit (i.e., for 
r = "sco = 3rs). 

By using the Poisson equation with cyclindrical coordinates, show that in any regular 
axisymmetric system with @(R,z) = @(R, — z) the following identity holds at all 
radii in the equatorial plane: 


kp +k> = 4Gp(R,0) + 227(R). (5.61) 


Conclude that (1) a and K? cannot both be negative; and (2) from Exercise 5.11, that 


in spherical systems a cannot be negative. What happens outside the system (i.e., 


where p(R,0) = 0) and outside a spherical system? Explain the results physically. 
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5.16 With this exercise we determine the circular velocity and the epicyclic frequencies 
in the equatorial plane for the axisymmetric case of Eq. (2.28), with aj = az and 
a3 = qd, so that p = pop(m)/q;. First, by using Eqs. (2.21) and (5.9), show that 


R dt 


ay+t (af +1)?,/az +1 


Next, consider for simplicity the spherical case and prove that v2 (r) ~ GM/r for 
r — oo. Then, with the help of Eq. (2.73), compute the vertical epicyclic frequency 


Co 
u2(R) = 20 GR*a}qz / p (5.62) 
0 


R dt 


Jani) Geo 


Finally, obtain the radial epicyclic frequency from the sum rule in Eq. (5.61). As a 
more challenging exercise, compute Ke from the Rayleigh formula in Eq. (5.15) by 
using Eq. (5.62), and verify the sum rule. 

5.17 By using the results in Exercise 5.16, show that the circular velocity in the equatorial 


[oe] 
k2(R) = 21 Gajq, [ p (5.63) 


plane of the axisymmetric (oblate) ellipsoidal power-law model 


R2 2 
p=) Wa 4+, (5.64) 
qzmY ay 4zay 
is given by 
y-3 (3—-y 1 ie eee 
W2(R) = 2a Goats — 92)°='B ( sil 2) ( ) (5.65) 
a Le 


where B(a,b; x) is the incomplete Euler beta function in Eq. (A.55) and for conver- 
gence y < 3. Compute the frequency of the deferent Qo and the radial epicyclic 
frequency as a function of gq, and y, and repeat the exercise for the prolate case. 
Observe that for y = 2, corresponding to the ellipsoidal generalization of the singular 
isothermal sphere in Eq. (2.63), ve is independent® of R and the beta function reduces 


to 2 arcsin ,/1 — q?; evaluate the limit for g, — 1 and show that v2 = 47 Gpoa;, in 
accordance with Eq. (2.63). 

5.18 For an axisymmetric, homeoidally expanded system with the potential given in Eq. 
(2.31), obtain the general formulae for uc(R), kr(R), and «,(R). Repeat the exercise 
by using the potential of an axisymmetric system obtained from elliptic integrals as 
in Eq. (2.120), and finally for the axisymmetric case of the ellipsoidal potential in 
Eq. (2.78). 


6 Notice that at this stage we know of four different density distributions with perfectly flat rotation curves over the whole space: 
the singular isothermal sphere, the y = 2 power-law ellipsoid in this exercise, the a = 4 torus in Exercise 5.7, and the 
untruncated Mestel disk. It is easy to show that the first three systems, when projected face on (i.e., along the symmetry z-axis; 
see Chapters 11 and 13 for the concept of projection), all reduce to the untruncated Mestel disk. 
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Exercises 101 


From Eq. (5.15), show that the epicyclic radial frequency at large distances from an 
axisymmetric ellipsoidal body, up to the quadrupole order, is given by 


GM GM (a; _ ay 


) 
h 5.66 
R3 2h? es 


2 
KR= 


(i.e., the radial frequency decreases (increases) for oblate (prolate) distributions). 
Hint: From Eq. (2.94), first show that the angular velocity of the deferent is given by 
_ GM GM(a; — a3 


) 
Q2 = an h, 5.67 
0 Re oe oe 


so that circular obits in the equatorial plane of oblate (prolate) distributions rotate 
faster (slower) than in spherical systems of the same total mass. 

With this exercise we determine the formula of orbital precession angle (A@)pert in 
the case of an axisymmetric perturbation ¢; of the axisymmetric potential ¢. Let 
dpert = @ + €¢1, where € is some small ordering parameter. First prove the obvious 
identities et (R) = 27(R) + €Q7(R) and KR pert R) = KR(R) + ekp ,(R), where 
Q(R) and kr(R) are the circular and radial epicyclic frequencies of the unperturbed 
potential at radius R. Second, from Eq. (5.19), show that 


2 ke (Rk 
Qz(R) ky a (5.68) 


dQ = (A@)pert — (AP)e=0 ~ €7 ES 2(R) 


so that (A@)pert is known once (Ag)¢=0 and d¢ are known. Finally, apply Eq. (5.68) 
to the case of the ellipsoidal quadrupole perturbation in Exercise 5.19 and show that 


bp at a ae 
2x = 2R? 
(see also Landau and Lifshitz 1969, chapter 3). Notice that from Eq. (5.68) the effects 


of the opposite sign of the perturbation in Eqs. (5.66) and (5.67) add up, so that in the 
oblate case the orbit precesses, while in the prolate case it recedes. 


h, (5.69) 


With this exercise we prove an obvious (but nontrivial) property of epicyclic expan- 
sion. We begin by recalling that angular momentum J, (per unit mass) is kept fixed 
in the second-order expansion (5.12) in terms of the small quantities (79/ Ro, Zo/ Ro). 
Suppose one is able to solve the true equations of motion (5.4) for assigned rg and 
zo: then, when computing the z component of J = x A v from the obtained solution, 
we would recover J,, and this identity must hold at all orders in the expansion of the 
solution. In practice, from order balance, it follows that all terms of the series, except 
the zeroth order (i.e., the circular orbit of the deferent), must be separately identically 
zero. Verify by direct computation of the second-order epicyclic orbit from Eq. (5.18) 
that in fact 


(KA V)z = Jz = R6Q0 (5.70) 
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5.22 


Quasi-Circular Orbits 


of the deferent orbit. Hint: Use the vector expression in Eq. (5.18) and its time 
derivative from Eq. (A.28). 

In this exercise we consider a plausible generalization of the Laplace-Runge—Lenz 
vector in Eq. (4.21). First, for a mass m in a generic potential energy U (x), define 


L=U(x)x+vAJ, mx=-—VU, (5.71) 

and show that 
2E\|JI" 
m 


LI? = U7 (x) (Ix? + Ga) 


Then show that in the two-body problem (when in the previous identities x and v are 
the relative separation and velocity of the pair, m — wu, J > J; and E — E;,; see 
Chapter 4) we recover L = Gmmel. Second, show that in a spherical system with 


|x|| =r and U(r), 
dL dU 
— = (uv 4 2) vy. (5.73) 


Third, show that for orbits in the equatorial plane (x, y) of axisymmetric systems with 


U(R,z), R= fx? 4+ y?, and J = Jef, 


L = (Jzvy + UR)f — Jvpfy = R 


| fe, (5.74) 


OR 


where the last identity holds for circular orbits; deduce that for these latter cases L(t) 
is aligned with the star position, with the exception of the 1/r potential, when L = 0. 
Compute ||L||, verify the equivalence with Eq. (5.72) evaluated for equatorial orbits, 
and conclude that for these orbits ||L|| can depend on time only through R(t). Finally, 
discuss the time dependence of L for equatorial orbits in an epicyclic approximation 
(see also Lynden-Bell 2006). 

The radial epicyclic frequency is related to the Oort’s constants by a simple relation. 
Prove that 


«2(R) = 4Q(R) B(R) = 427(R) + 4Q(R) A(R). (5.75) 


Hint: Use Egs. (5.15) and (5.37) defined at a generic radius R in the galactic plane 
and the identity B(R) — A(R) = Q(R). 


Part II 


Systems of Particles 


6 
The N-Body Problem and the Virial Theorem 


The N-body problem, the study of the motion of N point masses (e.g., stars) under the 
mutual influence of their gravitational fields, is one of the central problems of classical 
physics, and the literature on the subject is immense, starting with Newton’s Principia 
(e.g., see Chandrasekhar 1995). Conceptually, it is more the natural subject of celestial 
mechanics than of stellar dynamics; however, experience suggests that at least some space 
should be devoted to an overview of the exact results of the N-body problem in a book like 
this. In fact, due to the very large number of stars in stellar systems, stellar dynamics must 
rely on specific techniques and assumptions, and one may legitimately ask which of the 
obtained results hold true in the generic N-body problem; for example, these exact results 
represent invaluable tests for validating numerical simulations of stellar systems. The virial 
theorem is one such result, and in this chapter we present a first derivation of it, while an 
alternative derivation in the framework of stellar dynamics will be discussed in Chapter 10. 


6.1 The N-Body Problem and the Lagrange—Jacobi Identity 


As is well known, in a generic inertial reference frame So, the differential equations! of the 
N-body (N > 2) problem read 


N 
%=-G YD mit, ny =i xj, G=1,....N. (6.1) 
j=l: jAi ij 

These equations are generally not solvable in the informal meaning of the word (i.e., they 
cannot be reduced to 6N — | independent integrations, being the right-hand side member 
explicitly independent of time; e.g., see Whittaker 1917; Wintner 1947), and so we must 
resort to numerical methods to obtain their solutions (e.g., see Binney and Tremaine 2008; 
Heggie and Hut 2003). Nevertheless, many important properties of N-body systems can be 
explored analytically by direct use of Eq. (6.1): obviously, the two-body problem discussed 
in Chapter 4 is the most important special case. 


! With i. igi we mean “sum over j from | to N, excluding j = 7.” With jal we mean “sum over i and j from | to N, 


excluding i = j.” And finally with Ly j=! we mean “sum over i and j from | to N.” 
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Let us start by introducing some standard notation. We indicate with rmin = minj¢; rij 
and rmax = max r;; the minimum and the maximum interparticle distances, and with M = 
yu ; mj; and 4 = minm; the total mass of the system and the minimum particle mass, 
respectively. The kinetic and potential energy of the system and its polar moment of inertia” 
are given, respectively, by 


[te Ge mim al 
= 5 domillvil?, ace d 7 1= Yi milli’. (6.2) 


i=l i=l 


Note that in the expression for U the factor 1/2 takes care of the fact that with the sum we 
are counting each pair twice. 

As is well known from classical mechanics (see Exercise 6.1), in any inertial frame So 
the following quantities are conserved for an isolated N-body system (i.e., they are time 
independent) when evaluated along with the solution of Eq. (6.1): 


N N 
E=T+U, P= So mivi, J= > mixi Avi, (6.3) 
i=l i=1 
where E, P, and J are the total energy, momentum, and angular momentum, respectively. 


Therefore, the general N-body problem also admits the same 10 classical integrals of 
motion already encountered in Chapter 4 in the study of the two-body problem. In So, 


N 
1 1 P 
Rcm = u y : miXi, Vom = u y mii = 5, (6.4) 
i= [= 


are the position and velocity of the center of mass. 

A particularly important reference system is the inertial barycentric reference frame 
Scm, with its origin located at Rcm, moving with constant velocity Vcm, and with axes 
parallel to the axes of So, where from a Galilean coordinate transformation 

x =Reom + x;, Vi = Vom+V;, (6.5) 
and so, while U = U’, it follows that 
T=Eqmt+T’, E=EmMt+eE’, J=Iemt+ J, P=Pcu, (6.6) 


where the quantities Poy’ = MVcm, Jom = MRcm A Vem, and Ecm = M ||Vemll?/2 are 
obviously time independent, and the meanings of E’, T’, and J’ are apparent; in Appendix 
A.1.1, the general transformation formulae of the quantities above for an arbitrary change 
of reference system are given. 

Due to the time independence of J, when J’ # 0, the plane defined in Scy4 as 


Tiny = {x € = (x, J’) = 0} (6.7) 


2 Note that the polar moment of inertia / = Tr(/;;) = Tr(3;;)/2 is half of the trace of the inertia tensor 3;; in Eq. (2.89). 
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is called the invariable plane, and obviously Rem € Uiny. Moreover, the polar moment of 
inertia in So is related to the correspondent quantity J’ as 


N 


1=MReMIP +2’, 1! = DY millxll? Ess mim jr. (6.8) 
i=} ee 1 


The last expression for I’, proposed by Lagrange (1736-1813), is of great importance in 
application, and in Exercise 6.2 the student is guided toward proving it. In Exercise 6.3, we 
will also prove the remarkable exact bounds that hold Vr 


Gu ; GM? 2 MI'(t) 
lao) <Trmin(t) < UO M < Tmax(t) < ae (6.9) 


Physically, the (inverse of the) gravitational energy is a “measure” of the minimum separa- 
tion of particles, while the square root of the barycentric moment of inertia is a “measure” 


of the maximum interparticle distance. 

We conclude this section with two important identities that hold in the generic inertial 
reference system Sg (and so in Scy), and these will be used in the following discussion. 
The first is the Lagrange—Jacobi identity 


;j 
5 SOT + US EFT =2E-U. (6.10) 


The second and third identities in Eq. (6.10) derive immediately from energy conservation 
(6.3), once the first identity is established. Due to its relevance to stellar dynamics, in 
Exercises 6.4—6.6 we will prove the Lagrange—Jacobi identity in three cases of increasing 
generality; in the last one we will prove a general tensorial identity for systems of N bodies 
interacting with central power-law forces in the presence of an external force field, and we 
show that its trace in the self-gravitating Newtonian case reduces to the first identity in 
Eq. (6.10). We will encounter Eq. (6.10) again in tensorial form in Chapter 10. The second 
result (see Exercise 6.7) is Sundman’s (weak) inequality (see Boccaletti and Pucacco 1996 
for the strong case) 


(Jl? < 217 =1( - 26). (6.11) 


The next three sections will focus on singularities, special solutions, and the long-term 
behavior of the N-body problem. These arguments are only of tangential interest for stellar 
dynamics; nevertheless, a qualitative comprehension thereof is important for a better under- 
standing of the (idealized) dynamical fate of stellar systems produced by stellar dynamics 
on (very long) timescales (see Chapters 7 and 8). As the celestial mechanics literature on 
the N-body problem is immense and highly technical, here we only provide the interested 
reader with a list of some of the most relevant references (with different degrees of dif- 
ficulty): Abraham and Marsden (1978), Arnold (1997), Boccaletti and Pucacco (1996), 
Hagihara (1970), Meyer and Hall (1992), Roy (2005), Szebehely (1967), Valtonen and 
Karttunen (2006), and Wintner (1947). 
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6.1.1 Singularities 


The investigation of the singularities of the ordinary differential equations (ODEs) describ- 
ing the N-body problem is an active field of research in celestial mechanics, with seminal 
contributions coming from to Poincaré (1854-1912) and Painlevé (1863-1933); fascinating 
accounts are given in the books by Barrow-Green (1997) and Diacu and Holmes (1996). 
In fact, one of the main problems posed by the nonlinear equations in (6.1) is that their 
solutions can develop movable singularities (i.e., singularities that depend on the specific 
choice of initial conditions x;(0) and v;(0) and that make the analysis of the problem 
extremely difficult). In fact, in contrast with the nonlinear ODEs, according to a theorem 
from Fuchs (1833-1902), the homogeneous /inear ODE of order 1 


PI tie PFO 
dx" + Dale) dx' = 0 dx® 


= y(x), (6.12) 
i=0 

where aj(x) are functions of the independent variable (see also Eq. (A.135)), can present 
only fixed singularities placed at the singularities of the coefficients a; that are independent 
of the initial conditions and that therefore can be determined independently of the abil- 
ity/possibility to find a solution of the equation itself. Moreover, the nature of the singular 
points (regular or essential) and the behavior of the solutions in their neighborhood can be 
determined by using Frobenius’ (1849-1917) theorem (e.g., see Jackson 1998; for detailed 
discussions, see also Arfken and Weber 2005; Bender and Orszag 1978; Ince 1927; Lomen 
and Mark 1988). An illustrative example of fixed and movable singularities’ is provided in 
Exercise 6.8. 

Therefore, at variance with linear problems, in the highly nonlinear N-body problem, it 
is natural to expect movable singularities in the particle orbits that take place at times that 
are dependent on the assigned initial conditions; a simple example is provided by the two- 
body problem (see Chapter 4) that for J’ 4 0 never develops a singularity, while for J’ = 0 
the two bodies collide at a finite time. Remarkably, there are theorems providing bounds of 
collision times in the N-body problem as a function of the initial conditions. Here, we only 
report a classification of the possible singularities: 


(1) Collisions: Lim tying Fon = 0: 

(2) Pseudo-collisions: lim inf; ting ‘min = O and lim sup,_, tsing “min > 0. 

(3) Blow—up: lim;-. tsing "max = CO, with fsing < 00. 

(4) For N = 3, all singularities are collisions. 

(5) If I’ = O(1) fort > tsing, then the singularity is a collision. 

(6) If the N particles are on a straight line, then all singularities are collisions. 
(7) A singularity at fing is a collision at and only if U = O[(t — baie 3 ]. 


3 We will encounter the concept of fixed singularities again in Appendix A.2 when discussing the properties of some of the most 
important special functions arising in potential theory. 
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(8) The set of initial conditions in phase space leading to a collision at a finite time has 
zero Lebesgue measure. 

(9) Weierstrass-Sundman theorem: the global collapse+ of an N-body system may occur 
only in a finite amount of time (i.e., fsing < -+-0o) and only if J’ = 0 (see Exercise 6.9). 


In particular, Point (8) shows that collisions in the N-body problem are exceedingly rare. 


6.1.2 Special Solutions 


Under particular circumstances (i.e., symmetries), the N-body problem can be solved 
explicitly. From what we have said before, these solutions are of great interest because 
they are the only ones where the celebrated problem can be investigated in full detail. The 
most famous cases concern the three-body problem and come from Euler (1707-1783) 
and Lagrange (1736-1813): these are not just mathematical curiosities, but have important 
astronomical counterparts (e.g., see Roy 2005). Here, only an illustrative overview is given, 
and we begin with some definitions. In the barycentric inertial frame Scy, a solution of the 
N-body problem is said to be: 


(1) planar if there exists a plane II with a time-independent orientation containing all of 
the bodies at any time; 

(2) flat if there exists a plane I1(t) containing all of the bodies at any time; 

(3) syzygial at time fo if at f = fo the N particles all lie on the same straight line; 

(4) rectilinear if there exists a straight line A with a time-independent orientation contain- 
ing all of the bodies at any time; 

(5) collinear if there exists a straight line A(f) containing all of the bodies at any time; 

(6) homographic if there exists a scalar function A(t) > 0 and an orthogonal matrix R € 
SO(3) so that Vr the position of each particle is related to its initial position by 


x(t) = AC)R(t)x?; (6.13) 


(7) homothetic, if in Eq. (6.13) R is the identity for all times; 

(8) relative equilibrium if in Eq. (6.13) A(t) = | for all times; 

(9) central configuration if there exists a scalar function o (t) > 0 so that fori = 1,...,N 
and Vt 


Vx;U = o(t)mix;. (6.14) 
In addition, the following results (no proof is given) hold: 


(1) if the solution is planar and J’ 4 0 and the plane I coincides with Tiny in Eq. (6.7); 
(2) a planar solution may exist even for J’ = 0; 


4 “Global collapse” means that all interparticle distances go simultaneously to zero (i.e., in Soy limp+7.., 1/(t) = 0 and 


U = —o0). 


sing 


lim; feing 
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(3) if J’ = 0, then any flat solution is planar; 

(4) every planar solution is flat; 

(5) not every flat solution is planar (e.g., the solution of the general three-body problem 
is always flat); 

(6) from Results (3) and (5), any solution of the three-body problem with J’ = 0 is planar; 

(7) if Iliny exists, then any syzygial configuration lies on Tiny; 

(8) every rectilinear solution is collinear; 

(9) every collinear solution is flat; 

(10) every collinear solution is planar: in fact, if J’ 4 0, from Result (7) the straight line 
A(t) lies on Tiny Vt, and so the problem is planar. If J’ = 0, from Results (3) and (9) 
the solution must be planar; 

(11) Laplace theorem: solutions of the N-body problem are central configurations if and 
only if they are homographic. 


As already mentioned, the most interesting solutions from a physical point of view are 
the homographic/central configurations, such as the Euler collinear and the Lagrange planar 
(equilateral) solutions (see also Appendix A.1.1). 


6.1.3 Long-Term Behavior 


Many exact and asymptotic results are known for the N-body problem when t — oo. Such 
theorems are highly technical in nature: the interested reader is again invited to consult the 
given list of references. In order to illustrate the nature of such results, we simply recall 
(see also Exercise 6.10) that in the barycentric system Scm: 


(1) For E’ < 0, a positive constant A exists so that rmin(t) < A Vt. 

(2) For E’ = 0, rmin(t) = O(t?/3) anda positive constant A exists so that rmax (t) = At2/3 
fort > ~. 

(3) For E’ > 0, rmin(t) = O(t) and a positive constant A exists so that rmax(t) > At for 
tf —> ©. 


Result (1) means that when E’ < 0 the minimum interparticle distance rmin(t) remains 
bounded, while nothing can be said about rmax(t) (i.e., at variance with the two-body 
problem, particles may escape from an N-body system with barycentric total negative 
energy; e.g., as happens in the phenomenon of gravitational evaporation of stars from open 
and globular clusters; see also Section 6.2.1 and in particular Binney and Tremaine 2008; 
Chandrasekhar 1942; Heggie and Hut 2003; Spitzer 1987 and references therein). Result 
(2) means that rmin(f) cannot grow faster than t?/3 when E’ = 0 (and of course it may 
well remain bounded); in contrast, rmax(t) cannot grow slower than 12/3 (i.e., at least one 
particle must escape in case of vanishing barycentric total energy). Finally, from Result (3) 
it follows that, when E’ > 0, rmin(t) cannot grow faster than f (and it may well remain 
bounded), while rmax(¢) cannot grow slower than f; at least one particle must escape from 
an N-body system with barycentric total positive energy. From the point of view of stellar 
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dynamics, the most important results can be summarized as follows: E’ < 0 is a necessary 
condition for all interparticle distances r;;(t) to be bounded at all times and E’ > Ois 
a sufficient condition to have at least one escaper. Therefore, the evolution of the system 
is somewhat determined by the sign of E’: a system with E’ > 0 will necessarily eject 
particles (in principle up to a complete dissolution) if after each escape the energy of the 
remaining system remains positive or until E’ < 0. From this time, the system can remain 
bounded or eject particles, with the limit case finally reducing to a pair of particles on a 
very tight orbit accounting for all of the initial negative energy and diminished by all of the 
kinetic energy carried away by the escapers. 


6.2 The Scalar Virial Theorem 


We are now in a position to introduce the concept of the virial. Due to its relevance to stellar 
dynamics, the virial identities will be derived again in Chapter 10 along a different path. At 
present, we restrict ourselves to the (second-order) scalar virial theorem for self-gravitating 
systems, and we begin our discussion by recalling that an N-body system is called self- 
gravitating if the total force under which the N particles move is determined by the masses 
themselves through Eq. (6.1). By definition, such a system is virialized if in the barycentric 
inertial reference system Scm 


0297 S=y.. vs, (6.15) 


In Eq. (6.15), the limit case of T’ = 0 is of little practical interest, formally corresponding 
to a degenerate configuration with all particles (stars) at infinite mutual separation and 
motionless. Notice that if a self-gravitating system is virialized, then its barycentric total 
energy cannot be positive; in other words, necessarily 


E’ <0, (6.16) 


because for a self-gravitating virialized N-body system the last two identities in Eq. (6.10) 
dictate that T’ = —E’ and U = 2E’. 

Therefore, from the Lagrange—Jacobi identity, it follows that if a system is at equilibrium 
I ' = 0), then it is necessarily virialized,? and that if a system is virialized, then Il =0. 
The last consideration seems to leave open the possibility that, after all, we could have 
a virialized system with I’ = const. 4 0; however, this is impossible because it can be 
proved (see Exercise 6.11) that a self-gravitating N-body system is virialized not only if 
and only if /’ = 0, but actually if and only if the stricter condition 

dI' 


—=0, Vt 6.17 
Th (6.17) 


5 Sometimes in the literature the redundant concept of “virial equilibrium” is encountered, suggesting the impossible case of a 
system at equilibrium but not virialized! 
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holds. Moreover (see Exercise 6.12), a self-gravitating N-body system is virialized if and 
only if the kinetic and potential energies are constant 


T'(t) = T'(0), U(t) = U(O), (6.18) 
and two positive constants A and B exist such that 
Asrj@Q<B, ViAj=l,...,N. (6.19) 


A more general definition of virialization, extending the concept to time-dependent sys- 
tems and so relaxing the strict coincidence of the concepts of equilibrium and virialization, 
can also be introduced (e.g., Landau and Lifshitz 1969). Let f(t) be a given function of 
time and define 


1 t 
(fir = -f f(t)dt. (6.20) 


If limyso0(f)+ exists, such a limit (f) is called the time-average of f. Now, for a self- 
gravitating N-body system 


ar’y=-(U) & IW @)=o(t) fort > oo. (6.21) 
In fact, the time-averaged Lagrange—Jacobi identity reads 


'a)-1O) | 


; 2(T')r + (U)s, (6.22) 


thus proving Eq. (6.21). Notice that if r (t) = o(t), then I(t) = o(t?) (asymptotic relations 
can always be integrated), and according to Eq. (6.9), rmax(t) = o(t) for t > oo. Point (3) 
in Section 6.1.3 then excludes the possibility of time-averaged virialization for systems 
with E’ > 0, a conclusion that can also be reached more simply from energy conservation, 
because (E’), = E’, and so in a system virialized in a time-averaged sense 


(T') = -E',  (U) = 2E’ (6.23) 


(i.e., E’ < 0). 

Of course, a virialized system is also virialized in a time-averaged sense, but the converse 
is not true; the classical examples are those of systems performing bounded and/or periodic 
motions, such as the negative energy case of the Lynden-Bell and Lynden-Bell (2004) 
N-body system (see Exercise 6.5) and the elliptical two-body problem, where from Eqs. 
(4.7) and (4.30) 


no AiMtme Ie ’ — ’ 
(T’) =G » (T) = AT"), (TT) = —(T). (6.24) 
a mt mf 


For a virialized system, the virial velocity dispersion oy and the virial radius ry are 
defined as 

2 2 

Moy GM 


=T' 
Die rv 


=|U\, (6.25) 
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so that in the new variables 

2T’ |U| GM 
M Mw 
and perfectly analogous expressions hold in a time-averaged sense. The student should 


appreciate that Eq. (6.26) does not add anything new to the identities (6.15) and (6.21), 
being simply a rewriting in terms of new variables.° In Chapter 13, we will compute oy 


2 _ 
ov = 


, (6.26) 


and ry for some of the most common models used to represent stellar systems. 

Before moving to the next sections, where we will consider a few important applications 
of the virial theorem to stellar dynamics, it is necessary to mention two fundamental prob- 
lems that arise immediately from the previous discussion. We anticipate that an enormous 
body of literature is available on these two subjects, and here, for reasons of space and 
technical difficulty (being well above the level of this book), we simply mention them and 
give some relevant references for further study. 

The first natural question is how a stellar system, initially out of equilibrium (e.g., a 
proto-galaxy during the epoch of galaxy formation or a pair of galaxies undergoing a 
merging in the local universe), reaches its equilibrium configuration, and how the structural 
and dynamical properties of the final state depend on the specific initial conditions. In stellar 
dynamics, the dynamical process leading to virialization is known as violent relaxation 
(Lynden-Bell 1967; Tremaine et al. 1986; see also Bertin 2014; Binney and Tremaine 
2008; Saslaw 1987). The proper way to discuss violent relaxation would be by using 
concepts from statistical mechanics: here, in very elementary terms, we simply notice that 
violent relaxation is not due to the effects of the star—-star gravitational interactions that 
will be discussed in the next two chapters, but to global fluctuations of the system’s time- 
dependent potential that redistribute the energies of the stars independently of their mass 
(see Footnote 8 in Appendix A). Moreover, violent relaxation acts on “short” timescales 
compared to other relaxation phenomena, of the order of a few dynamical times tyr ~ 
tayn © 1/ /Gp, where p is the average density of the system. The important message here 
is that theoretical and numerical works nicely show that the virialized end states of violent 
relaxation describe quite well the global structural and kinematic properties of real systems 
such as early-type galaxies (e.g., see Bertin and Stiavelli 1984; Bertin and Trenti 2003; 
Stiavelli and Bertin 1985, 1987; see also the end of Section 12.2.2; from the numerical 
point of view, see the seminal paper of van Albada 1982; see also Nipoti et al. 2006 
and references therein) and dark matter halos (e.g., Aguilar and Merritt 1990; Dubinski 
and Carlberg 1991; Navarro et al. 1997). Finally, and from a more general point of view, 
notice that violent relaxation somewhat depends on the specific nature of the gravitational 
force: there are systems such as the N-body system in which the particles interact with 


6 A comment is in order here. ry and oy are perfectly defined for the vast majority of the models encountered in stellar 
dynamics; however, there are a few important peculiar cases. For example, in the untruncated King (1972) model in Eq. 
(13.34) and in the untruncated Navarro et al. (1997) model in Eq. (13.40), the peculiar r3 density profile at large radii leads 
to a divergent total mass M but also to a finite U (see Exercise 13.7); for these models, ry = oo and oy = 0. Even more 
peculiar cases of ry and oy arise in systems with divergent M and U, the most famous being the singular isothermal sphere in 
Eq. (2.63). See Exercise 13.21 for a discussion. 


114 The N-Body Problem and the Virial Theorem 


the harmonic oscillator force and no phase mixing (e.g., see Binney and Tremaine 2008) 
can take place (see also Exercise 6.5), and there are other forces (e.g., modified Newtonian 
dynamics) where the virialization times are significantly longer than for the 1/r? force 
(e.g., see Di Cintio and Ciotti 2011; Di Cintio et al. 2013; Nipoti et al. 2007). 

Once we have a virialized stellar system, the second natural problem concerns its sta- 
bility, as not all equilibrium configurations are stable. Very sophisticated analytical and 
numerical techniques can be used to address this problem, which is of the utmost impor- 
tance in stellar dynamics, because not only can we expect that the vast majority of stellar 
systems we observe are (more or less) stable, but also instabilities (driven by internal or 
external phenomena) can be important for the evolution of these systems. An impressive 
number of results are known (e.g., see Bertin 2014; Binney and Tremaine 2008; Fridman 
and Poliachenko 1984; Palmer 1994). Among the very general and sufficiently simple 
stability criteria routinely used in stellar dynamics (see also Exercise 13.33), here we only 
mention the so-called radial orbit instability criterion (e.g., Bertin et al. 1994; see also 
Di Cintio et al. 2017; Nipoti et al. 2002, 2011; Polyachenko and Shukhman 1981; Saha 
1991 and references therein), stating that stellar systems with “too many” radial orbits 
are unstable, and in the case of stellar disks we mention the Toomre stability criterion 
(Toomre 1964; an extension to stellar dynamics of an analogous result from fluid dynamics 
for rotating fluid disks). We finally recall the global Ostriker—Peebles stability criterion 
(Ostriker and Peebles 1973) for disk galaxies that, combined with the observational flat 
rotation curves of HI at large radii in disk galaxies (e.g., see van Albada et al. 1985), 
provides very strong support to the existence of extended and massive dark matter halos 
around galaxies. 


6.2.1 The Virial Plane: Negative Specific Heat of Self-Gravitating Systems 


In this section, we illustrate how the virial theorem in its simplest form allows to derive 
fundamental physical properties of self-gravitating stellar systems, such as their negative 
specific heat and their slow collapse as a consequence “gravitational evaporation” of stars 
(see Chapter 7), finally leading to the so-called gravothermal catastrophe, one of the most 
important concepts for our understanding of the dynamical evolution of globular clusters 
(e.g., see Lynden-Bell and Eggleton 1980; Lynden-Bell and Wood 1968; see also Bertin 
2014; Binney and Tremaine 2008; Spitzer 1987). 

All of these phenomena can be qualitatively illustrated by using the virial plane in 
Figure 6.1. As we are dealing with self-gravitating gravitational systems, we plot on the 
horizontal axis the quantity |U|; therefore, points near the origin represent very “diluted” 
systems, while moving to the right we encouter more and more concentrated systems. 
On the vertical axis we represent the system’s kinetic energy: points at the bottom of the 
plane represent “cold” systems, while at increasing ordinate values the systems are more 
and more “hot.” We first ask what the position is of a virialized system in Figure 6.1. 
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Figure 6.1 The virial plane for self-gravitating systems with 1/ r? forces, where the locus of virialized 
systems (virial theorem (VT), heavy line) and a few energy conservation lines are represented for 
different values of the total energy. No virialized states exist for positive values of the (barycentric) 
total energy E. Systems are qualitatively indicated as hot, cold, diluted, and concentrated depending 
on their position in the plane. 


From Eq. (6.15), it follows that a self-gravitating system is virialized if and only if its 
representative point is placed on the virial theorem line 


_ ul 
al 


On the same plane we can now draw the family of straight lines representing energy 
conservation 


7 (6.27) 


T' =|U|+E’. (6.28) 


If a system evolves at constant E’, then its representative point can move only on the line 
fixed by the initial conditions. Obviously, the line T’ = |U| represents systems of zero total 
energy, parallel lines above represent unbound systems, while parallel lines below represent 
systems with total negative energy. The first important consequence that can be obtained 
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by inspection of Figure 6.1 is that only systems with total negative energy have a chance 
of beging virialized, as only the constant energy lines of this family have intesections with 
the virial theorem line, in agreement with the algebrical conclusion in Eq. (6.16). 

In practice, suppose we have a stellar system with given initial conditions, so that we 
can determine the initial position of its representative point in the virial plane. If the point 
is on the virial theorem line, then the system is in equilibrium and, if the equilibrium is 
stable, the point will remain there forever. If the total energy is positive and energy losses 
are forbidden, the point will necessarily move on its energy conservation line in Eq. (6.28) 
without having the possibility of virializing, as no intersections can occur with the virial 
theorem line. Finally, if the total initial energy is negative, one possibility — compatible 
with energy conservation — is that the point moves and finally stops (after the phase of 
violent relaxation) on the virial theorem line at the position consistent with its negative 
total energy. 

Suppose now that we start with a virialized system at Point | in Figure 6.1, and assume 
that a fraction of its kinetic energy is lost because for some reason a star is ejected,’ carring 
away its kinetic energy (such as in the case of gravitational evaporation, e.g., see Binney 
and Tremaine 2008; Chandrasekhar 1942, 1943b,c; Hénon 1960; Spitzer 1987), so that T’ 
is reduced, while U will be almost unaffected. Graphically, this is represented by the arrow 
moving down from Point | — Point 2; in practice, we “cooled” the system at fixed U. 
The system is now out of equilibrium, and its point cannot stay at Point 2: if the system 
virializes, the only place possible in the virial plane is at Point 3, corresponding to a more 
concentrated configuration and a higher “temperature.” In more suggestive language, we 
can say that the system “cooled,” but after virialization the self-gravity finally heated it 
to a greater temperature than the initial one. Of course, the opposite would happen along 
Path 1 — 4 — 5 in Figure 6.1. In a useful thermodynamic analogy, we can say that 
self-gravitating systems are characterized® by a negative specific heat (e.g., see Lynden- 
Bell and Wood 1968). What we have just described is the physical basis of the phenomenon 
of gravothermal catastrophe occurring in globular clusters — the progressive contraction of 
their inner regions associated with the continuous ejection of low-mass stars produced by 
the tendency toward equipartition (see Chapter 7). Unsurprisingly, similar considerations 
also apply to gas spheres. If we have an equilibrium gas sphere (e.g., a protostar) at a 
temperature higher than that of the environment, then the Second Law of Thermodynamics 
rules, and nothing can prevent the star from losing energy through radiation (i.e., having 
dE'/dt < 0). If the process is slow (i.e., if the Kelvin time tx, = E’/E' is much longer than 
the gravitational time 1 /./Gp), then the star configuration will readjust continuously, and 
macroscopically its representative point will climb along the virial theorem line, following 
the time derivative of Eq. (6.23) 


7 Of course, ina system with N > 3, such ejection is compatible with the system’s total barycentric negative energy E’ < 0 (see 
Section 6.1.3). 
Negative specific heats are not peculiarities of self-gravitating systems. For example, see Footnote 4 in Chapter 10 about 
polytropic transformations of a perfect gas. 
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d(T’) ‘dE’ d(U) _ jdE (6.29) 
dt ~— dt’ dt dt’ ; 


The protostar contracts and heats up. Of course, even if it grows hotter and hotter (until 


contraction stops when finally the central temperature is high enough to produce nuclear 
reactions? in the core), actually the protostar becomes poorer and poorer in total energy. 
Unfortunately, the two relations above are sometimes “read” in a quite inappropriate way, 
producing some difficulty for students. In fact, if we say that in a quasi-static contrac- 
tion “half of the change in gravitational energy must be radiated and half goes toward 
increasing the temperature” (a perfectly correct statement from the mathematical point 
of view!), then the physical weight of the argument is put on the gravitational energy.!° 
Instead, the contraction is driven by the energy loss dE’/dt, which with virialization is 
redistributed among kinetic and gravitational energies as in Eq. (6.29). In practice, the 
combined effects of gravity and the virial theorem behave like a quite strange person who 
own two bank accounts (T’ and U) and who for each expense (E’ < 0) not only charges it 
to the account U, but also takes from U the equivalent of the expense and transfers it to the 
account T’. Vice versa, for each gain (E’ > 0) the money is put into the account U and an 
equivalent amount is also transferred from T’ to U. 


6.2.2 Dissipationless Merging, Collapse, and the Virial Theorem 


We conclude this chapter with the qualitative (but far-reaching) illustration of the conse- 
quences of the virial theorem for the phenomenon of galaxy merging (i.e., the fusion of two 
initially distinct galaxies or dark matter halos; see Toomre 1977) and of structure collapse 
(i.e., the evolution toward equilibrium of an energetically bound gravitational system with 
no or little initial kinetic energy). The relevance of these phenomena to astrophysics is so 
obvious that no further emphasis is needed. 

For simplicity we restrict the discussion to the dissipationless case where only gravity 
is at play (i.e., we exclude the effects of energy gains and lossess due to baryonic physics) 
and we briefly comment on the interesting conclusions that can be obtained when simply 
considering the virial theorem and three of the most important empirical scaling laws 
obeyed by elliptical galaxies (e.g., see Bertin 2014; Binney and Tremaine 2008; Cimatti 
et al. 2019; Sparke and Gallagher 2007; see also Ciotti 2009 and references therein). 
The first of these relations is the Faber-Jackson law (Faber and Jackson 1976), which 
we write as 


* awr4, (6.30) 


9 Ina beautiful analogy, we can also say that the core contraction of globular clusters due to gravothermal catastrophe stops 
when the clusters start to “burn” the gravitational energy of binary stars in the core (see Binney and Tremaine 2008; Spitzer 
1987). 

10 There are cases when the physics is actually driven by U instead of E’, such as when a star contracts due to the increase in the 
mean molecular weight in the core produced by nuclear burning — and due to the correspondent decrease in central pressure. 
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where o, indicates the observed value of the velocity dispersion of a given elliptical galaxy 
(measured within some prescribed aperture; see Chapter 13) and L is its total (absolute) 
luminosity in some prescribed photometric band. Basically, this extremely important rela- 
tion says that in elliptical galaxies the stellar velocity dispersion increases with their total 
luminosity, something the student should recognize as not obvious.!! The second important 
empirical scaling law is due to Kormendy (1977), 


Loa Rf, BS1 (6.31) 


(in Chapter 13 we will encounter another very important scaling law, the fundamental 
plane, which contains and encompasses the two previous laws). The third and final family 
of scaling laws we consider relates the mass and/or velocity dispersion of the host stellar 
spheroids and the mass Mpgy of their central supermassive black holes 


Mpy « M,, Mpy «x al, yr4+5 (6.32) 


(e.g., see Ferrarese and Merritt 2000; Graham 2016; Gebhardt et al. 2000; Magorrian et al. 
1998; Merritt and Ferrarese 2001; Tremaine et al. 2002). 

For our idealized experiment, we consider two self-gravitating, virialized stellar systems 
with initial separation much larger than their sizes, so that 


Ey, = —Ty = 2Uj, Eig = —Tiz = 2U 2. (6.33) 


For sake of generality, we also consider some extra energy AE (negative or positive) in 
the initial conditions, mimicking, for instance, the merging of a bound pair or the fusion 
of two stellar systems in a slightly hyperbolic orbit. During the merging, we assume that 
no mass is ejected, and consistent with the assumptions we enforce energy conservation 
Es = Ei + Ei2 + AE; finally, we impose the virialization of the end product, with Er; = 
—T; = 2U; and, in terms of kinetic energies, 


Moy, + M204, 2AE 

M, + M2 M,+ M2 
Therefore, in a case of parabolic merging, ovr cannot be larger than the maximum velocity 
dispersion of the progenitors; for bound pairs ov¢ can increase, while for hyperbolic merg- 
ings ovr decreases. If we focus on potential energy instead of kinetic energy, we obtain the 
analogous relation for the final virial radius 


Ty = —Ep = Ty + Ty — AE > 04, = 


(6.34) 


Et AE 1 M?/ry, + M3 /rv2 AE 
Up = > = Uj + Ui2 + >—=— 2 7 
2 2 've (M, + M2) 2G(M, + M2) 
(6.35) 


so that ry will shrink for AF < 0 and significantly expand for AE > 0. In particular, note 
that in the idealized case of parabolic merging of indentical systems ovr remains identical 


1 For example, if the size of bright galaxies had been much larger than it is, their velocity dispersion would have been much 
lower than observed; see Eq. (6.26). 
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to that of the progenitors, while ry doubles. It is elementary to conclude that even though 
ov and ry are related to the observed velocity dispersion o, and to the effective radius 
R, by nontrivial nonhomology effects (see Chapter 13), repeated merging poses important 
constraints on galaxy formation scenarios, in particular due to the difficulty of increasing 
the velocity dispersion and the excessive growth of the virial radius with the galaxy mass 
as required by Eqs. (6.30)-(6.32). These considerations add to the list of points given in 
the classical reference by Ostriker (1980). Of course, gas dissipation helps to increase the 
velocity dispersion and radius contraction (e.g., see Ciotti and van Albada 2001; Ciotti et al. 
2007; Gonzalez-Garcia and van Albada 2003; Nipoti et al. 2003). 

In Exercise 6.13, we will discuss the conclusions that can be reached on the problem of 
the collapse of a dissipationless, gravitationally bound structure with “cold” initial condi- 
tions; this simple computation is relevant for explaining why the virial velocity dispersion 
of dark matter halos formed in cosmological simulations increases with the halo mass, 
something that does not happen in the too-simplistic scenario where large halos form 
simply by the “merging” of similar small halos (e.g., see Lanzoni et al. 2004; Nipoti 
et al. 2006). The formation of dark matter halos should be more appropriately seen as 
the collapse (and homogenization) of initially “cold” and inhomogeneous structures: the 
primordial density fluctuations. 


Exercises 


6.1 By using Eq. (6.1), prove that in a generic inertial reference system So, dE /dt = 0, 
dP/dt = 0 and dJ/dt = 0. Deduce that Vom, Ecm, and Jcm are time-independent 
quantities and so that the general N-body problem admits the 10 classical integrals 
of motion already encountered for the two-body problem in Chapter 4. 

6.2 Prove Eq. (6.8). Hints: Note that the scalar r;; is invariant for the translation of the 
origin from Sp to Scm. First, by using Eq. (6.5), prove that ee mir; =l'+ 
M IIx) \|?, then multiply both sides by m j and sum over j. 

6.3. Show that the inequalities in Eq. (6.9) hold. Hints: For the first inequality, consider 


, (6.36) 


N 
G mim; Gmjm j Gu 
Wl=s> >> —*e > 
ris Peni Fos 
i#j=l ij min min 
where mj; and mj represent (one of) the pair of particles at a minimum relative 
distance; then 


N N N 
G mim j G G GM? 
|U| = A< mim; < mim; = . (6.37) 
2 », Vij 2rmin 22 Be Oe x ee Fee 
For the second inequality 
N 2 N 2 
Mr-., 
l= 7] mim jj; < pe mim; = a (6.38) 


i,j=l i,j=l 
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and 
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Prove the scalar Lagrange—Jacobi identity (6.10) directly from Eq. (6.2). Hint: First 
obtain J = 2 x i mj ((X;,Xi) + (X;,X;)), and then from Eq. (6.1) show that 


al GS mim; 
W =) milxi.X) a = =U. (6.40) 
f=] iA#j=1 , 
With this exercise, we extend our elaboration on the Lagrange—Jacobi indentity 
(6.10). The particles of a system interact with the force law 


xj — Xj xj — Xj 
B+1 : 
ij ij 


rij =(xi-xjl, iAJZ, (641) 


so that fora #¢ B and A- B F 0, ||F;;|| is not a power-law. From Exercise 6.4, first 
show that 


N 
1 A B 
W= 2 a mM; pont + “poi . (6.42) 
i#j=l ij "ij 


and compute W for the Newtonian force (B,a) = (0,2), for the harmonic oscillator 
(B,a) = (0, —1), and for the case (B,a) = (0, 1). Then, show that the “gravitational” 
energy of the system is given by U = Ug + Ug, where 


fo GF Se 6.43 
ce =) a a a ars mee 

for a # | anda = 1, respectively. Show that fora ¢ | and 6 ¥ 1, 
W = (a — 1)Uq + (B — Ug. (6.44) 


For a very remarkable application of the Lagrange—Jacobi indentity obtained when 
(a, 6) = (—1,3), see Lynden-Bell and Lynden-Bell (2004); for other applications, 
see Ferronsky et al. (2011). 

With this exercise, we conclude our discussion of the Lagrange—Jacobi identity (6.10) 
and we obtain its generalization in tensorial form for an N-body system of particles 
interacting with central forces also in presence of an external force field F'. Let 
F(kl) = —F(k), the force exerted by the mass m(/) on the mass m(k), so that for the 
particle k 


N 
m(k)&(k) =F) + SFU). (6.45) 
1=1;14¢k 


6.7 


6.8 
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First, introduce the two tensors 


N 


N 
1 
ij = So m(k)xi (k)xj(k), Ti = 5 So mk) xi (k)x;(k), (6.46) 


k=1 k=1 


and consider the second-order time derivative of the tensor J;;. From Eq. (6.45), show 
that 


e Le 
3 = 20+ S> {bile — 1 OLF (KD) + xj) — xj OF KD} 
IAk=1 
i N 
+5 DLO FM © + xj OFM OO], (6.47) 


k=1 


where the first double sum on the right-hand side is the symmetric tensor W;;. Then, 
show that the trace of the previous identity is 


N N 
t_ ext _i 
[7 2T+W+ LOE (k)), W= 5 2 (6.48) 


where W = Tr(W;;), 1 = Tr(i;) is the polar moment of inertia in Eq. (6.2), and 
r(kl) = x(k) — x(/). Finally, by using the results from Exercise 6.5, compute W for 
the forces 

r(kl) 


r(kl) = |r), kA, (6.49) 
and prove that Eq. (6.10) is recovered for F°**' = 0, A = —G, and a = 2. 


Prove the (weak) Sundman’s inequality ||J|* < 2/7 in Eq. (6.11). Hint: In a generic 
inertial reference system So, 


N N N 
JU = Do max; A vill < do millx; A vill < D5 mallxill + [vill 
i=l i=l i=l 


N N N 
=o vmilxill Vmillvill s| So millxi? | SmilivlP, 6.50) 
i=1 i=1 i=l 

where the chain of inequalities follows from the Cauchy—Schwarz and triangular 
inequalities in Eqs. (A.6) and (A.8). 

Here, with the aid of elementary differential equations amenable to explicit solutions, 


we illustrate the concepts of fixed and movable singularities. Let us consider a linear 
ODE as in Eq. (6.12) with 


aff 


ig tata =) LO= fo (6.51) 
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Show that the solution is 
x 
f(x) = fo exp (4) . (6.52) 


The (essential) singularity occurs at x = 1, the pole of the coefficient is aj, and 
its location is independent of fp, as expected. Solve again Eq. (6.52), where now 
dg = 1/(1 — x) and the singularity is regular. Finally, consider the nonlinear ODE 


d 
a ~~ =0, fO=h (6.53) 
x 
and show that the solution is 
_ fo 
a ee a (6.54) 


The movable singularity occurs at x = 1/fo (i.e., its location depends on fo). 

Prove the Weierstrass—Sundman theorem in Section 6.1.1. Hints: By definition of 
global collapse (see Footnote 3), lim;—,, U = —oo, and so lim; ring I’ = +00 
from the Lagrange—Jacobi identity (6.10). We first prove that a global collapse must 
occur at a finite time. In fact, the divergence implies that a time t, exists so that 
I' > 2fort > ty (ie, I(t) > 1? + at + B) if tsing = +00, then lim, +4,.n5 I' = +00, 
which goes against the hypothesis of global collapse. The second part of the theorem 
can be proved as follows: first, for t € [t+,fsing] from the definition of collapse 
we have I’ < 0, while again from limy— iin. I’ = +00 one concludes that J’ is a 
monotonically increasing function of time (i.e.,0 > I/(t) > I/(t,) and so I'(t,)* > 
I'(t,)? — I'(t)? > 0). After multiplication of the Sundman inequality (6.11) in 
S’ by the (positive) quantity —/’/7’, and after integration over ty < ft < tsing, 
we have |J'||? In{7’(t4)/1'()] < U'(t4)? — 71/2 + 2B — I(t) Ge. 
ims ting \|J’|| = 0). But the total angular momentum is conserved, and so J’ = 0. 
Prove Result | and the second statement of Result 3 in Section 6.1.3. Hints: Regard- 
ing Result 1, if E’ < 0, then |U| > |E’| from Eq. (6.3), and from Eq. (6.9) we 
have rmin(t) < 0.5GM7/|E’|. Regarding Result 3, from the Lagrange—Jacobi identity 
(6.10) with E’ > 0, it follows that 1’ = 2(T' + E’) > E’ Vt, and so I’ > at? for 
t + oo and some a > 0. Equation (6.9) again proves the statement. 

Prove Eq. (6.17). Hints: The sufficient condition is a trivial application of the 
Lagrange—Jacobi identity (6.10). The necessary condition is again proved using 
the Lagrange—Jacobi identity: from /’ = 0, it follows that I/(t) = I’(0)t + I'(0). If 
I'(0) = 0, then the result is proved; if 1'(0) 4 0, then for t > oo or t ~ —oo we 
would reach the impossible conclusion that I’(t) — —oo. 

Prove that Eqs. (6.18) and (6.19) give necessary and sufficient conditions for virial- 
ization. Hints: First, consider necessity. If the system is virialized, then T’ = —E’ 
and U = 2E’, so that Eq. (6.18) holds. Moreover, fori # j, 2\|E’| = |U| > 
Gmym j/rij => Gy? /rij, and so in Eq. (6.19) A = 0.5Gy?/|E"|; in addition, from 
Eq. (6.17), 1'(0) = I'@) = mimjrj,/M > y?r7,/M, and so B = JT(O)M/p. 
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For the sufficient part, by integrating the Lagrange—Jacobi identity twice under the 
assumptions in Eqs. (6.18) and (6.19), we conclude that I’(t) = [27’(0) + U (0) ]t? + 
ju (0)t + I'(O) is a bounded function, so that the coefficients of the time-dependent 
part of I’(t) are zero (i.e., Z(t) = I’(0)), and Eq. (6.17) completes the proof. 

With this exercise, we evaluate the increase of oy after virialization of a dissipation- 
less, self-gravitating, collapsing bound structure: this process illustrates qualitatively 
why the velocity dispersion of dark matter halos obtained in numerical simulations 
increases with the halo mass. Let Uj be the initial gravitational energy of the system 
and let 27; = —aU;, with 0 < a < 1, be its initial kinetic energy, so that a measures 
the “coldness” of the initial conditions. From energy conservation and the virial theo- 
rem, show that the final kinetic energy, in the absence of significant mass ejection, is 


T=-(1-3) i= (2-1) fi. (6.55) 


Compute the variation of oy from the initial conditions to the final state and deduce 
that an increase of oy in the mass of the system requires that more massive structures 
are initally more bound (per unit mass) than low-mass systems. 


7 
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Two-Body Relaxation 


In this chapter, we introduce one of the fundamental and most far-reaching concepts in 
stellar dynamics (and of plasma physics for the case of electric forces): that of “gravitational 
collisions.” As an application of the developed framework, the two-body relaxation time is 
derived (using the Chandrasekhar approach) by using the so-called impulsive approxima- 
tion. The concepts of the Coulomb logarithm and of infrared and ultraviolet divergence 
are elucidated, with an emphasis on the importance of the correct treatment of angular 
momentum for collisions with small impact parameters, an aspect that is sometimes puz- 
zling for the students due to presentations in which the minimum impact parameter appears 
as something to be put into the theory “by hand.” On the basis of the quantitative tools 
devised in this chapter, we will show that large stellar systems, such as elliptical galax- 
ies, should be considered primarily as collisionless, while smaller systems, such as small 
globular clusters and open clusters, exhibit collisional behavior. These different regimes 
are rich with astrophysical consequences, both from the observational and the theoretical 
points of view. 


7.1 The Granular Nature of Stellar Systems 


How much of a star’s orbit in a stellar system depends on the forces acting upon it due to 
more or less nearby stars? When we model a stellar system, can we ignore the fact that 
the true gravitational field experienced by each star is described by the discrete system of 
Eq. (6.1)? Or are we doomed to deal with the almost insurmountable difficulties of the 
N-body problem? 

In order to answer these fundamental questions, we ideally replace the discrete (dis- 
continuous) distribution of the N masses with a smooth (continuous) density distribution 
p(x;t); at this stage, nothing is specified about how such smoothing is performed. In prac- 
tice, the density distribution p, taken to be a satisfactory description of the true “granular” 
N-body system, will serve as a continuous model for the real system, and the associated 
smooth potential ¢@ (in general time dependent) is given by Eqs. (2.4)-(2.6). As long as 
this continuous approximation holds, the motion of each star is determined by the smooth 
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potential (x; f), and so the dimensionality of the phase space! is reduced from 6N to 6 
(but in the last case an explicit time dependence can appear), and the mathematical problem 
is enormously simplified. Of course, the stellar orbits in the smoothed system are only 
approximations of the real orbits, because in reality each star (e.g., the i-th star) actually 
moves in the true potential 


N 

drue(X.t)=-G >> 
j=L jAi 

associated with the gravitational field in Eq. (6.1). 
The problem we face can thus be summarized by the following question: How large is 
the difference between the fictitious orbit described by a given star under the influence of 
¢ and the true orbit described under the action of ¢iye? Or, stated differently, given that 
Pirue = 6 + (Pirue — &), how long will it take for the “granularity” term ¢irye — & to change 
significantly the orbit that would be performed by each star in the smooth potential ¢? Of 
course, the exact answer to such a question would require knowledge of both orbits, and 
this is precisely the difficulty we would like to avoid! Therefore, we must find some clever 
way to obtain a quantitative estimate of this time without resorting to the solution of the 
N-body problem. This problem is of such great importance to physics that different 
approaches have been developed to answer it (e.g., see Binney and Tremaine 2008). In 
the following, we will use what is perhaps the simplest, approach at least in terms of 
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visualization. 

We first introduce a few basic definitions that are needed to proceed with our discussion, 
and these will also be useful for the rest of this book. For an N-body system made for 
simplicity of identical stars of the same mass m, the nowhere negative function n = 
n(x, V;f) is defined by the property that at each time ¢ 


AN(Q;t) = i n(x, v; t)d°xd°y, G2) 
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which is the number of stars contained in a given arbitrary six-dimensional region Q C y of 
phase-space y; obviously, over the whole phase space AN(y;t) = N. A function strictly 
related to n is the phase-space distribution function (DF) f = f(x, v;t) = mn(x,v;1t), so 
that from the previous definition 


AM(Q;t) = i f (xv; t)d?xd3v, (7.3) 
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where AM is the total mass of the stars that at time ¢ are found in the region Q. Of course, 
AM(y;t) = M = mN is the total mass of the system. Note that in an N-body system 
made of k = 1,...,K different species of stars of masses mz, with N = ame N; and 
M=>- fae m, Nx, the functions nz and f; are naturally introduced as 


! We indicate with I the 6N dimensional phase space of the true system, and with y the six-dimensional (one-particle) phase 
space of a single star (see also Chapter 9). 
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Of course, it is also possible to extend the relations above to the case of a continuous dis- 
tribution of masses (obtaining a so-called extended DF formalism), and the sums would be 
replaced by integrals over the mass spectrum (e.g., see Chapter 8). A completely analogous 
set of definitions also holds for the smooth density distribution p. In this case, the relation 
between the material density at x and the DF is obtained by considering in Eq. (7.3) the 

phase-space region Q = A*x x 3, so that for a sufficiently regular f 
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as the integration over all of the velocity space ensures that we are certainly “counting” all 
of the particles in the “volume” d°x. Finally, if the system consists of K different density 
components px, 
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For the sake of the present discussion, we simply assume that the function f exists as a 
useful mathematical object that contains all of the needed dynamical information about 
each star of the system under consideration. We will discuss in more depth the physical 
meaning and the mathematical properties of the DF in Chapter 9. 


7.2 The Impulsive Approximation 


The standard approach to the calculation of the so-called relaxation time (i.e., the charac- 
teristic timescale beyond which the continuous approximation introduced in Section 7.1 is 
certainly no longer valid) is based on two assumptions: 


(1) All encounters between particles are treated as independent (i.e., the effect of each 
encounter is simply added to that of the others). 
(2) All encounters between particles are treated as hyperbolic two-body encounters. 


For these reasons, the derived characteristic time is called the two-body relaxation time, 
and it is indicated as fop. A gravitational system for which the cumulative effects of the 
encounters are negligible will be called collisionless, or, otherwise, collisional. Obviously, 
real N-body systems are never, strictly speaking, collisionless. Nevertheless, we will show 
that models based on the equations derived under the assumptions of perfect noncollisional- 
ity play an important role in stellar dynamics. The general treatment of the evaluation of fap 
can be found in well-known monographs (e.g., see Bertin 2014; Binney and Tremaine 2008; 


2 Notice that the physical dimensions of the DF are those of a density divided by a velocity cubed. 
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Chandrasekhar 1942; Saslaw 1987; Spitzer 1987; and for a different approach based on the 
statistical properties of the gravitational field, see Chandrasekhar and von Neumann 1942, 
1943). Here, we follow the simplest and more intuitive approach based on Assumptions (1) 
and (2) above, and for simplicity we further restrict ourselves to the case where all of the 
field particles have the same mass. 

We describe our N-body system of total mass M in an inertial reference system So 
with the aid of its phase-space DF f = f (xg, ve); the particles (stars) that make up the 
system have identical masses m¢ (where “f” stands for “field’’). Moreover, we consider a 
“test” mass m;, that at t = —oo has initial position x;(—oo) and initial velocity v;.(—0o), so 
that the total barycentric energy of each pair is positive and Assumption (2) is therefore 
satisfied. Following Chapter 4, for each pair (m;,mf), let yz be the reduced mass and 
r(t) = x(t) — x(t) and v(t) = y(t) — ve(t), respectively, the relative position and the 
relative velocity of m, at time t with respect to a given mr. Moreover, let v(00) be the 
asymptotic relative velocity of the two particles in the resulting two-body hyperbolic motion 
(i.e., their relative velocities for t — +00, respectively), let Av = v(oo) — v(—ox) be the 
total variation of the relative velocity (see Figure 4.1), and finally let J’ = pr(t) A v(t) 
and E’ = yllv(t)||7/2 — Gmyme,/||r(t)|| be the total (constant) angular momentum and 
total energy of the pair, respectively, in the inertial barycentric reference system Scyy of the 
pair. As is shown in Chapter 4, the impact parameter b of the resulting orbit is given by 


Eq. (4.24), ||v(oo)|| = ||v(—oco)||, and the final relative velocity v(oo) rotates with respect 
to the direction of the initial relative velocity v(—oo) by an angle 2Ag — zr, obtained from 
Eq. (4.25). 


We are now in a position to quantify the concept of the two-body relaxation time fp. 
The basic idea behind Assumption (1) is to compute the sum of the effects of all of 
the two-body encounters evaluated in the fictitious case of the absence of the remaining 
N — 2 stars and to define some threshold so that when the total effect is larger than the 
threshold, a relaxation time has passed. The student should appreciate that actually nothing 
special abruptly happens when the system’s age crosses the relaxation time, since from the 
beginning the granularity of the gravitational system affects the orbit of each star, such that 
as time increases these orbits can be described less and less accurately according to the 
corresponding orbits in the idealized smoothed system. Different indicators can be used 
to define the cumulative effects of the two-body interactions and to derive an estimate for 
top: for example, Chandrasekhar (1942) adopted a suitably defined average value of the 
cumulative angular orbital deflections or alternatively the variation of the kinetic energy 
of the test particle. Other possibilities can be devised, such as by computing the so-called 
diffusion coefficients in the velocity space and their associated diffusion (relaxation) times 
(e.g., see Binney and Tremaine 2008, appendix L; Rosenbluth et al. 1957; Spitzer 1987). 
Here, we will follow the simplest approach (i.e., we will focus on the kinetic energy varia- 
tions associated with the perpendicular component of the velocity of the test particle with 
respect to the initial relative velocity v(—oo)). This last point deserves special attention 
because its meaning is not as obvious as it may appear from a superficial exposition. 
In fact, as we are considering the phenomenon in the inertial reference system So, one 
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could think that the natural decomposition of the final velocity v;(-++oo) of the test star 
after each collision would be along the parallel and perpendicular directions with respect 
to the initial velocity v;(—0oo). However, this is not the case: as we will see, the natural 
decomposition is instead along and perpendicular to the initial relative velocity v(—oo). 
This may appear quite strange, as in each encounter v(—oo) is obviously oriented along 
different directions in So! 

So let us start by focusing our attention on a single encounter event; then we will 
add some (suitably choosen) properties of each of the (hypothetically independent) two- 
body encounters of the test star with the field stars in So by using the DF of the system. 
Examples of such properties are represented by (but not limited to) the components of the 
total velocity change Ay; of the test star in an unbound orbit in So as given rigorously by 
Eq. (4.16) and/or by the associated change of energy AE; in Eqs. (4.18) and (4.27) that in 
the particular case of the r~ force can be obtained in closed form by using the solution of 
the hyperbolic two-body problem in Eqs. (4.14), (4.15), and (4.33). 

In order to sum a preferred orbital property P of the test star over all of the encounters, 
we must first specify the phase-space DF of the field stars, as it contains the “initial con- 
ditions” of the field stars to be used in each encounter. This DF is usually written under 
the simplifying assumptions of a spatially homogeneous distribution of the field masses 
(i.e., the number density m¢ is independent of position) and in the case of isotropy of their 
velocity distribution. In practice, the phase-space density distribution of the field stars is 
given by 
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and g is a positive function dependent (as a consequence of velocity isotropy) on the 
modulus of vr; the normalization of the last integral is an obvious consequence of Eq. 
(7.5). Simple geometry shows that, under the previous assumptions, in the time interval 
At the number of encounters with field particles in the differential velocity volume d*v; 
around vr, and with impact parameter between b and b + db, is 


Anenc = 2bdb x |\v_ — vel| At x neg (vp)d vr. (7.8) 


The content of the expression in Eq. (7.8) is deeper than its deceptively simple algebraic 
appearance. In fact, even if the direction of v;(—o) in So is fixed for all of the encounters, 
the direction of the axis of the cylinder of radius b is oriented in different directions for 
different field stars, and also its length is different, as these quantities are dependent on 
the direction of the initial relative velocity of the test star with respect to the field stars in 
the velocity volume d*vs; in particular, note that Eq. (7.8) holds because n¢ is independent 
of xp. Therefore, while the test star is moving in a given direction in So, the student should 
imagine that it is actually traveling inside a sea of cylinders with all possible orientations 
and lengths in the relative velocity space. Now, for a given orbital property P of the test 
star, in full generality we introduce the diffusion coefficient 


7.2 The Impulsive Approximation 129 


D(P) = 2nn¢5 / 
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where (P) is the angle average of P around the direction of the initial relative velocity. In 
practice, D(P) measures the time rate of change of the orbital property P as a result of 
the cumulative effects of the encounters with the field stars (e.g., see Binney and Tremaine 
2008, appendix L, and references therein). 

Therefore, suppose we solved the two-body problem of a given encounter, and so we 
have the expressions of Av, and Av, for the relative orbit, and then from Eq. (4.16) 
we have the exact expressions of 


Avi, = = Av, AV = Ay (7.10) 
t t 


in So. At the risk of repeating ourselves, we recall one more time that the two last quantities 
are the components of the total variation of the test star velocity v; perpendicular and 
parallel to the initial relative velocity, not to v;(—00). 

We begin with a discussion of the diffusion coefficient D(Av;, ) of the perpendicular 
velocity component. It should be quite obvious that under the assumptions of Eq. (7.7), the 
angle (Av_) vanishes, and so we deduce that in an homogeneous and isotropic distribution 
of field stars, D(Av,) = D(Ay;, ) = 0; it can be shown that this result is also true when 
considering the case of the perpendicular component to the initial velocity v;,(—0o) in So. 
In the next chapter, we study Av; and we will show that this quantity is not zero, even in 
our homogeneous and isotropic field distribution. 

Having dealt with the case of the diffusion coefficients of the first-order velocity com- 
ponents, we now can move to the most basic second-order quantities (i.e., D(||Av;, \|7) 
and D(|| Avi \|7)). So we now consider the sum of the squared norm of each velocity 
deflection. Of course, these sums by construction are not zero. Before proceeding with 
the computation, it is worth understanding the physical meaning of this. Basically, the 
student should recognize that for a quantity such as Ay;, , having a zero mean value is 
not inconsistent with having a finite dispersion. The visualization of this is that, as the 
test particle travels in the field distribution, its mean direction remains unchanged, but a 
larger and larger conus of possible positions is associated with the average trajectory. In 
some sense this is equivalent to Brownian motion, such that the statistical average position 
of the particle remains the initial position, but the probability of finding the particle at 
a given distance from the intial position increases as the rate of the square root of the 
number of the scattering events over the fluid molecules (e.g., see Feynman et al. 1977, 
vol. 1, chapter 41). 

We first consider the diffusion coefficient D(|| Av, ||7): once obtained, we also have 
D(\Ave, ||?) = u?D(||Av||7)/m? from the exact Eq. (7.10). The integration of Eq. (7.9) 
presents some important conceptual problems. In Exercise 7.1, we proceed with the “exact” 
integration of D(|| Av, ||*), while here we follow the approach of impulsive approximation, 
which is often adopted in first expositions of the subject to students due to its use for 
presenting the basic physical arguments. 
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In each encounter of our test star with a field star, in the limit of a large impact parameter 
(i.e., high angular momentum), the change in the relative velocity perpendicular to the 
initial relative velocity v(—oo) is 


as v = |Iv(—00)|I. (7.11) 
v 

As can be seen by expansion of Eq. (4.33), the expression in Eq. (7.11) is asymptotically 
correct in the limit of a large impact parameter b or a large initial relative velocity v (see also 
Exercises 4.8 and 4.9). For practical purposes, the content of Eq. (7.11) can be described 
as the total deviation of the relative orbit obtained under the action of a force Gm ym¢/ b? 
acting in the perpendicular direction to the initial relative velocity for a time 2b/v, and from 
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this interpretation it is called impulsive approximation. Later on we will see why restricting 
ourselves to distant encounters allows us to obtain the correct (asymptotic) result after all. 
From Egs. (4.16) and (7.11), it follows that 


2.2 
G*m; 


2 Ww 2 
Ave = sllAvillr ~ 
mM 
Note how for large impact parameters there is no dependence on m;. Equation (7.9) then 
would reduce to 
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and it is apparent how integration over the impact parameter would lead to a doubly absurd 
result, with a logarithmic divergence both at b = 0 and at b = on. In particular, in the 
impulsive approximation an artificial divergence (the so-called ultraviolet divergence, sim- 
ilar to the analogous problem faced when computing the high-frequency component of 
the bremsstrahlung radiation emitted by an accelerated charge in plasma physics; e.g., see 
chapter 9 in Landau and Lifshitz 1971) appears for b = 0; however, when using the full 
solution of the two-body problem in Chapter 4, it is easy to show that such divergence 
disappears (see Exercise 7.1), and that the correct behavior of the integral near b = 0 is 
obtained by replacing the true limit in Eq. (7.13) with bz /2. Instead, the logarithmic infrared 
divergence for b — o cannot be eliminated in an infinite homogeneous gravitational 
system (while in the electromagnetic analog this divergence is not present, being the upper 
limit of integration that is naturally truncated at the Debye length). In practice, we are forced 
to truncate? the integral at bmax, a fiducial maximum impact parameter (e.g., see Binney 
and Tremaine 2008; Chandrasekhar 1942; Kandrup 1980; Spitzer 1987; Woltjer 1967. Also 
notice that even though it should be clear from the previous discussion that the question 
of the “exact” value of Dmax in a infinite and homogeneous system is strictly speaking 
ill-posed, some discussion on the optimal truncation of the Coulomb logarithm can be 


3 The infrared divergence shows that the cumulative effect of distant encounters dominates the two-body relaxation of 
three-dimensional systems, and this provides an “a posteriori” justification of the use of impulsive approximation. As an 
interesting exercise, the student should consider two-body relaxation in an idealized infinitesimally thin disk galaxy. For the 
case of modified Newtonian dynamics, where the Coulomb logarithm does not exist and the two-body relaxation is even faster 
than in Newtonian gravity, see Ciotti and Binney (2004). 
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found in the literature, such as the different points of view of Chandrasekhar and Spitzer 
on the subject, as summarized for example in Binney 1996 and van Albada and Szomoru 
2020). The final result can be expressed by introducing the Coulomb logarithm \n A, where 
the quantity A is given in Eq. (7.25). After integration over the impact parameter, Eq. (7.13) 
becomes 


b max 
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We finally integrate over the velocity space. Following Chandrasekhar (1942), we introduce 
the velocity-weighted Coulomb logarithm In A, 
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where In A in practical applications is estimated by using some average (typical) velocity 
vr of the field stars, and 
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in analogy with the result regarding the expression of the potential of a spherical density 
distribution.* In particular, 
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is the fractional velocity 14, analogous to the “mass” of a spherical system inside a sphere 
of “radius” 1. Therefore, the integral in Eq. (7.17) can be formally interpreted as the 
“potential” Y(v,) in velocity space of the “density” distribution g(v¢). Y(v_) is known as 
the first Rosenbluth potential, and it clearly depends on the specific velocity distribution of 
the field particles. The normalization condition on g in Eqs. (7.7) and (7.17) shows that for 
large values of vy; with respect to the characteristic velocities of the field stars, the potential 
is dominated by the “monopole” term 1/1; (e.g., see Exercise 7.3). In summary, we finally 
obtain the important formula 


D(\|Avi, ||?) ~ 82 G2ngm? In AW (vy), (7.18) 


in perfect agreement with Eq. (7.26). 

The cumulative effect of the encounters is to heat up the test particle in the direction 
perpendicular to the initial relative velocity. The characteristic time associated with this 
“heating” is defined in a natural way as 
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the so-called two-body relaxation time; the last expression holds in the limit of large v,. 


4 Remember Richard Feynman’s wise advice: “The same equations have same solutions!” 
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A natural question arises: Why have we defined the relaxation time by using the perpen- 
dicular component and not the parallel component, as suggested by Eq. (4.27)? A simple 
answer is provided by Exercise 7.2, where we evaluate the “exact” formula for D(|| Av, \|7), 
and we show that for large A 


D(\|Aviy ll?) ~ 4 G2neme (vy), (7.20) 


in other words, for large A, the parallel heating term is negligible compared to the 
perpendicular one, so that Eq. (7.19) is asymptotically correct also when one would add 
D(AE;) to the denominator. Why parallel heating is less important than perpendicular 
heating is simply a manifestation of the geometric fact that in a right triangle, when a 
cathetus is vanishing, the difference between the hypothenuse and the long cathetus is 
an infinitesimal quantity of order 2 (think of the expansion of the functions sin and cos 
for a vanishing angle), combined with the fact that relaxation is dominated (for large A) 
by distant encounters; we will return to this point in the next chapter. Finally, notice 
that fo) is (qualitatively!) inversely proportional to the DF of the field stars (see also 
Footnote 2). 


7.3 Relaxation Time for Self-Gravitating Systems 


In order to familiarize ourselves with the order of magnitude of fap in real astrophysical 
systems, we now proceed with a “back-of-the-envelope” estimate of Eq. (7.19) in the case 
of self-gravitating systems. Here, the student should focus on the physics of the prob- 
lem, without looking for rigorous algebra; the obtained results fully justify the qualitative 
approach followed, and more careful/sophisticated/realistic (and heavy) calculations would 
produce only correction factors of the order of unity, not changing the logical conclusions 
while obscuring the physical arguments. 

For the estimate of the various quantities entering Eq. (7.19), we consider a finite system, 
as in Exercise 1.1, of spherical shape and radius R, homogeneously filled with N identical 
stars of mass m = mg = m, SO that n¢ = 3N /(42 R*) (the famous “spherical cow!’’). 
Arbitrarily setting ry = R in the virial theorem as given in Eq. (6.26), we then have 
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and we adopt oy as the fiducial value for the characteristic stellar velocities in the system 
(i.e., Oy = Ve = Y;); for simplicity (but quite inconsistently!) we consider the high-velocity 
limit Y(v,) ~ 1/v;. We then express fp in the natural time unit of the system, its fiducial 
crossing time 
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so that simple algebra on Eq. (7.19) finally shows that at the leading order for large N 
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having set bmax = 2R and used Eqs. (4.26) and (7.21) to estimate the Coulomb logarithm. 

As an astrophysical application, let us consider a typical elliptical galaxy where 
N ~ 10!! and teross X 2x 108 yrs: the characteristic two-body relaxation time is then 
estimated to be in the order of 10°* Gyrs. Therefore, galaxies are essentially collisionless 
systems over cosmological timescales. In contrast, for globular clusters, where N ~ 10°*® 
and ftoross & 10° yrs, we obtain fap © 5 x ig yrs; for such systems, the cumulative effects 
of two-body encounters are then expected to be important on timescales shorter than 
their age, so these systems are marginally collisional, and this conclusion is confirmed by 
observations and numerical simulations (e.g., see Spitzer 1987). Finally, in open clusters, 
with N ~ 10273 and feross & 10° yIs, top © 107 yrs, an age shorter than the permanence 
time of the main sequence of massive stars, and this provides the explanation for why 
in general observed open clusters are made of young and blue stars (e.g., Binney and 
Merrifield 1998). 

Before concluding, we comment on a seamingly paradoxical conclusion that can be 
obtained from Eq. (7.23). In fact, we deduce that increasing the number of particles at a 
jixed total mass of the system (i.e., at fixed fcross), the effects of granularity of true appear 
to become important on longer timescales (i.e., fap increases!), and this, at a superficial 
reading, seems to be at odds with the fact that the test star experiences more collisions. 
However, the student should consider that two-body relaxation is not only due to the 
number of encounters, but also to the energy variations due to the encounters: 


AE, « N x energy exchange in one encounter. (7.24) 


However, the fixed total mass from Eq. (7.12) shows that the energy exchange in one 
encounter decreases as me o« N~, so that AE, « 1/N, leading to the increase of the 
numerator in Eq. (7.23). 

Why is the knowledge of fo) important in the astrophysical context? First, this time 
provides a nice indicator of what techniques should be used to study the system itself. 
Second, and more importantly, we know from statistical mechanics (e.g., Khinchin 1949) 
that every time a system of particles is subjected to the prolonged phenomenon of weak 
and statistically uncorrelated energy exchanges (fluctuations), the system is driven toward 
the establishement of a thermodynamical state, characterized by the universal Maxwell— 
Boltzmann distribution in Eq. (7.30) and by equipartition. This fact is of extreme impor- 
tance in astronomical systems that have a finite escape velocity (see Exercise 10.22 for 
a useful estimate). The final effect is to have gravitational evaporation of the low-mass 
stars and core collapse of the system (e.g., see Binney and Tremaine 2008; Spitzer 1987; 
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see also Chapter 6 for energetic considerations on the escape of particles from a system 
with total negative energy). 

The last comment concerns the observational fact that astrophysical systems such as 
galaxies and globular clusters are very inhomogeneous (i.e, the densities of their central 
regions are many orders of magnitude larger than their mean density); therefore, in detailed 
calculations it should be taken into account that f2p depends strongly on its position inside 
such systems. 


Exercises 


7.1 Show that the result of the integration over the impact parameter of Eq. (7.9) for the 
quantity D(\| Av, ||?) obtained from the first identify in Eq. (4.33) can be written in 
terms of the quantities 


4 
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where In A is the Coulomb logarithm, M = m, + mg, bmax is the fiducial maximum 
impact parameter (usually taken from the order of the size of the system), and v = 
||v(—oo)|| is the modulus of the initial relative velocity. Notice that the limit of large 
A (the standard case) refers to large values of bmax and nonzero values of v, while 
the (perculiar) case of small A refers to vanishing relative velocities. Then show that 
at the leading order for large A, 


D(\Avi |?) ~ 82 G?ngM7W (v;) In A, (7.26) 


where In A is the velocity-averaged Coulomb logarithm> defined in Eg. (7.15). 
Finally, verify that the same result is obtained by integration over b of the impul- 
sive approximated formula in Eq. (7.13), artificially truncating the lower limit of 
integration at bz/2. 

7.2 Show that the result of the integration over the impact parameter of Eq. (7.9) for the 
quantity D(|| Av ||?) obtained from the second identity in Eq. (4.33) can be written 
in terms of the quantity 
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5 Of course, nothing would prevent us from defining the velocity average of the full function in Eq. (7.25)! 


Exercises 135 


Then show that at the leading order for large A, 
D(\|Avy||?) ~ 42 G2ngM7 0 (v,). (7.28) 
7.3. Compute the velocity potential Y in Eq. (7.16) for a spatially homogeneous, 
Gaussian® velocity distribution 
e- U7 / 202) 
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g(vp) = 


in Eq. (7.7), and show that by setting 4, = ||v;|| and )} = v/(/200), 
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Hints: By using Eq. (7.17), first prove that 
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where Erf(x) is the standard error function in Eq. (A.57); verify that & (oo) = | and 
obtain the leading asymptotic term when y%, — 0 and y%, > oo. 

7.4 With this exercise, we consider the quite artificial but conceptually important case 
of a system with the velocities of the field particles spanning a very small range of 
values. Let (see e.g. Ostriker and Davidsen 1968) 


db (vg — Oo) 
g(vp) = ———, ve = |lvell, (7.33) 
Ar v5 


which is the velocity distribution, where o, is a given velocity (i.e., the distribution 
is an infinitesimally thin shell of “radius” o, in velocity space). First, compute W (v;) 
as in Eq. (7.16) and discuss the resulting expressions corresponding to Eqs. (7.26) 
and (7.28): What happens for y%, >> o. and u%. <« o.? Then consider a fixed value of 
bmax and evaluate Eqs. (7.14) and (7.20) in closed form by using the full expressions 
in Eqs. (7.25) and (7.27). What happens for 4%. — o> (i.e., for a test particle velocity 
modulus equal to the modulus of the velocity of field particles)? Do we have relax- 
ation? What about the relative importance of D(||Av;, \|7) and D(|| AY | \|7)? Hints: 
In order to compute the integrals over vr, use spherical coordinates in velocity space 
with the polar axis directed along v,. 


6 By considering oe = kgT/m, where kg = 1.38 x 10—16 erg/K is the Boltzmann’s constant, 7 is the temperature, and m is the 
mass of the particles, Eq. (7.30) is the Maxwell—Boltzmann velocity distribution, one of the most important objects of physics 
(e.g., see Born 1969). The associated (normalized) differential distribution in terms of the modulus of the velocity is 
dn= Arg (vp)uzdoe, so that its maximum, average, and root-mean-squared velocities, respectively, are given by 


= 8 —— =, 
Umax = V200, T= 4/—o0, Vu2 = V3o0. (7.29) 
w 
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7.5 Rosenbluth et al. (1957) introduced two potential functions in velocity space. The 
first is given in Eq. (7.16) and the second is 


xv) = / _ 8(vo)ll vi — villa? vr. (7.34) 
Di 


This occurs when constructing the diffusion coefficients in So with respect to vy. 
Show that for an isotropic velocity distribution x (v;) is a function’ of the velocity 
norm of the test stars 14, and that from Eqs. (2.126) and (2.127) with a = 0 (or by 
integration in spherical coordinates placing y; along the z-axis of the velocity space), 
this can be written as 


An ? Qa 2 = 2,2 
xX (Uy) = oo / g (vp) ug (3up + uPdoe + f g(up)upyy(3up + vp )dug}. (7.35) 
t 0 Ut 


Finally, calculate x(v,) for the velocity distribution in Eq. (7.33) and for the 
Maxwell—Boltzmann distribution in Eq. (7.30) and show that 


Ut 


J20 


Sie 
Xv) = V20. a (« + =x) Erf(y) |, Ur = (7.36) 


Ja 2 


7 Mathematically, x is a Riesz potential, similarly to those in Exercise 2.35. Physically, it is the analog in velocity space of the 
Roberts (1962) “superpotential” appearing in the evaluation of the self-gravitational energy of ellipsoidal systems, which will 
be encountered in Chapter 10. 
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Relaxation 2 


Dynamical Friction 


Dynamical friction is a very interesting physical phenomenon with important applications 
in astrophysics. At the simplest level, it can be described as the slowing down (“cooling”) 
of a test particle moving in a sea of field particles due to the cumulative effect of long- 
range interactions (no geometric collisions are considered). Several approaches have been 
devised to understand the underlying physics (which is intriguing, as the final result is an 
irreversible process produced by time-reversible dynamics; e.g., see Bertin 2014; Binney 
and Tremaine 2008; Chandrasekhar 1942; Ogorodnikov 1965; Shu 1999; Spitzer 1987). In 
this chapter, the dynamical friction time is derived in the Chandrasekhar (1943a) approach 
by using the impulsive approximation discussed in Chapter 7. 


8.1 The Chandrasekhar Formula 


We begin with a short review of the most important logical steps used in the derivation of 

dynamical friction in the classical case. From Eqs. (4.13)—(4.16), we have the exact identity 

(—00) 
v 


LL v 
Avy = —Avy), Avy = —l|Avyll : (8.1) 
mt 


where, as usual, v = ||v(—oo)|| is the modulus of the initial relative velocity and || Av\|| 
is given in Eq. (4.33). In Exercise 8.2, we will compute D(Av,) from Eq. (8.1) by using 
the exact expression of || Av) ||. However, in the spirit of Chapter 7, here we discuss the 
problem by using impulsive approximation, and from Eq. (8.31) 


——_v, M=m+my. (8.2) 


Note an important difference between Eq. (8.2) and Eq. (7.12): while in the case of the 
perpendicular velocity component the mass of the test star does not appear, here we have 
the product m¢(m, + mr), and this fact is rich in astrophysical consequences. Also note 
that the dependence of ||Avj)|| as the inverse of the cube of the initial relative velocity 
is asymptotically correct only in the impulsive approximation: for slow or grazing orbits, 
the functional dependence of || Ay;j || on vu is different. However, as in gravitational plasmas 
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there is no screening effect, it can be proved that the main contribution to dynamical friction 
comes from distant interactions (e.g., Spitzer 1987), so that the expression in Eq. (8.2) is 
the leading term. 

The sum over the number of encounters is performed following Chapter 7. Therefore, 
the diffusion coefficient of velocity of the test star, parallel to the initial relative velocity, is 
given by an expression that is analogous to Eq. (7.13) as 


v °° db 
D(Avy)) = —4 G?ngmeM BC BU) vg dvr if = (8.3) 
ws 0 &b 
where again the appearance of the ultraviolet and infrared divergence of the integral over 
the impact parameter is apparent. We repeat the same discussion as in Chapter 7, and we 
integrate between b,/2 and bmax, and again in analogy with Eq. (7.14) 


ese 


D(Avy) = —4 G?ngm¢M i: g(vp) nA d? ve. (8.4) 


| TE 


We now integrate over the velocity space. As in Chapter 7, we introduce again a velocity- 
weighted Coulomb logarithm! In A, and therefore Newton’s theorem on spherical shells 
(here applied to velocity space given the assumed isotropy of the velocity distribution of 
field particles) leads to the wonderful identity 


[. In A g(v) St yp = nd Oy, (8.5) 
m8 Ilvi — vell UF 


where &(v;) is defined in Eq. (7.17). Therefore, given the hypothesis of a constant value 
of In A, the velocity-dependent factor in Eq. (8.5) would simply be the gradient (with 
respect to the velocity v;) of the first Rosenbluth potential YV. The cumulative effect of 
the encounters is to slow down the test particle in the direction of the test particle velocity 
itself. This is not trivial, as according to Eq. (8.1) the deceleration in each single encounter 
is parallel to the relative velocity, and not to y,! The dynamical friction deceleration on 
a test mass m, moving with velocity v; in a homogeneous and isotropic distribution? of 
identical field particles of mass m¢ and number density n¢ is 


2 , Et) _ avy _ 
D(Avy) = —42 G*ngme(m, + me) In A zt = at’ Ut = |lvil |. (8.6) 
v 


f 


An important comment is in order here. According to Eqs. (8.5) and (8.6), only field parti- 
cles slower than the test particle contribute to its deceleration. This sharp “cut” in velocity 
space derives from two different assumptions, namely: (1) that the velocity distribution of 
field particles is isotropic; and that (2) that we can take the Coulomb logarithm outside the 
integral in Eq. (8.4). The latter hypothesis would be rigorously true only if In A is inde- 
pendent of v, but this — strictly speaking — is false because b,/2 depends on v. In fact, the 


| Note that this average value is not — strictly speaking — that which appears in Eq. (7.15). 

2 The following simple but important conceptual question sometimes arises in discussions with students: Why in a homogeneous 
and isotropic distribution of field mass DAM, ) = 0 from Chapter 7 is D(Ayj})) not zero? The fact is that in the reference 
system moving with mt, the velocity distribution of field particles is not isotropic in the direction of vt! 
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(small) correcting terms due to the dependence of In A on v can be explicitly evaluated (e.g., 
see Chandrasekhar 1941, 1942, 1943a; Merritt 2013; White 1949, see also Exercise 8.4). 

In analogy with Eq. (7.19), we are led to define the dynamical friction time in a natural 
way as 


3 
_ Iv(—oo) | v; 


df = 7 
|D(Avi) |] 42 G2ngme(me + mm) In A Ey) 


(8.7) 


Notice that in the numerator in Eq. (8.6) and in the denominator of Eq. (8.7), we can 
recognize the quantity nsms = /¢ (i.e., the mass of the field particles actually enters into 
the dynamical friction as the mass density, and so for m; >> mg the actual mass of the field 
particles does not appears). From this point of view, Eq. (8.7) can be used (for example) to 
study the effects of dynamical friction produced by a dark matter halo in a galaxy. 

We now use all of the results obtained so far in Chapter 7 and the present chapter and 
we evaluate the diffusion coefficient for AF; given in Eq. (4.27) 


D(AE,) = D(AE,,) + D(AEy) + m(D(Avi), Vi(—00)), (8.8) 


where the last expression derives from the fact that in our framework of a homegeneous and 
isotropic field distribution, D(Av;, ) = 0. Moreover, in the usual circumstance of a large 
A and ignoring the subdominant contribution of Eq. (7.20), from Eqs. (7.18) and (8.6), we 
obtain the important estimate 


1 D(AE I t 
2 (1 5 2), (8.9) 
tak, Ey 12b lat 


clearly showing how the relative importance of the two-body relaxation time and of the 
dynamical friction time determines whether the considered test star is heated or cooled. 


8.2. Dynamical Friction in the Presence of a Mass Spectrum 


As we have seen in the previous section, in the classical approach to dynamical friction all 
of the field particles have the same mass and their distribution is uniform in configuration 
space and isotropic in the velocity space. However, there are astrophysical situations in 
which a mass spectrum can have relevant effects, namely when the test particle (even 
though very massive) travels with a velocity comparable to the velocity dispersion of field 
particles, or when its mass is of the same order of magnitude as the average mass of the 
field masses. A specific example is represented by the population of blue straggler stars 
(BSSs) in globular clusters (e.g., see Ferraro et al. 2012, 2018 and references therein). In 
fact, BSSs are believed to originate by the merging or mass accretion of otherwise normal 
stars, so that their mass is at most a factor or so larger than the average mass of the stars 
in the parent cluster, and their mean velocities are similar to those of normal field stars. 
In addition, the stars of the globular clusters are characterized by a mass spectrum, and 
finally, globular clusters are collisional systems with relaxation and dynamical friction 
times comparable to their age. Other cases of test particles (with much larger masses) 
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moving with a velocity similar to that of field particles is represented by binary black holes 
in galactic nuclei (e.g., Merritt 2013 and references therein). These examples indicate that 
studying dynamical friction in a field particle distribution with a mass spectrum is important 
(e.g., see Ciotti 2010). 

Given the previous preparatory work, it is now easy to generalize the classical dynamical 
friction formula in Eq. (8.6), derived under the assumption that all of the field particles have 
the same mass, to the case of a mass spectrum of field particles. A generic mass spectrum 
with an isotropic velocity distribution is described, by extension of the classical treatment, 
with an extended phase-space distribution function 


éf = mN (me) g(vp,ms), f= / of dme, (8.10) 


where A (me) is the number density per unit volume and unit mass of field stars with mass 
in the range between mg and ms + dmg. The total number density of field stars, the average 
field star mass, and the mass ratio of the test star with respect to the average field star mass 
are given by 

t 


gee i Nimpdms, (ms) = ~{- NG@wondms, HRS. (8.11) 


(me) 
Of course, the normalization of the velocity distribution for each mass component is 
expressed by the fact that 


Ut 
B(v;y, me) = ax [ g(ue,me)uedue, H(co,me) = 1, VWme. (8.12) 
0 


In the presence of a mass spectrum of field stars, the differential number of encounters 
suffered by the test star is 
Anenc = 2mbdb x ||v; — ve||At x N (mp) g (ur, mp)dmed ve. (8.13) 
Therefore, by summing the formula obtained in the classical treatment over all of the 
species, the deceleration in the mass spectrum case is given by 


D(Avy) = —An G? (In A)g <f N(me)mgM E(uy,me) dmg 


= eee (m,)) (In A) 33 * (1) ~ (8.14) 


= 
Uv 


where (In A)s is the natural average of In A implicitly defined by the first line in 
Eq. (8.14).? In the last identity in Eq. (8.14), in contrast with the classical formula (8.6), 
the integration over the mass spectrum leads to the introduction of the function 


3 Inall practical applications, it can be safely assumed that (In A)g = In A. 
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Jo N(ms)meM E(v,,ms) dmg RH, (v4) + Ho (wr) 


=a* (uy) = = : 8.15) 
ng(me)(my + (mi) R+1 
where we recall that R = m;,/(me) and (prove it!) 
1 [o,@) 
H, (v4) = / N (me)me 2 (v4, mp) dme, 
ng(me) Jo 
(8.16) 


1 [o.2) 
Ho(v,) = alma? [ N (mpm? E(vt,me) dmg. 


In practice, from Eq. (8.16) one can derive the dynamical friction deceleration for the case 
of a mass spectrum by using the same formalism as the classical case, where me is replaced 
by (mg). It is important to note that for large velocities of the test star E(u4,mr¢) — 1, and 
formally 


oe : 
N (mp); dmg, E* (00) = R + Ho(0o) (8.17) 


Hj (oo) = 1, Ha(o0) = f “gins? R+1 


In order to illustrate the effects of a mass spectrum, in the following sections we present 
three explicit cases of mass spectra amenable to analytic solutions so that the differences 
from the equivalent classical cases can be quantified. In order to compare dynamical friction 
in the presence of a mass spectrum with the classical case (i.e., the case with identical field 
particles), we must carefully define the concept of the equivalent classical system. We 
will say that a classical case is equivalent to a mass spectrum case when: (1) the number 
density of field particles in the classical case is the same as the total number density in the 
mass spectrum case; (2) the field mass mz in the classical case is the same as the average 
field mass (mf); and (3) the velocity dispersion of the (Maxwellian) velocity distribution in 
the classical case is the same as the equipartition velocity dispersion of (mf) in the mass 
spectrum case, i.e., 


me 5 of a en / Zon) — g-vpr/(202),3/2 aie 
r=—, oO,=—, glume) = = , . 
(mf) née On Pra, (27)3/203 
so that from Eq. (7.32) 
~ 2 
2v, =U F 
Limite = go" (8.19) 


Vu V20. 


We can summarize these conditions by saying that the comparison is between two systems 
with the same number, mass, and kinetic energy density of field particles. Of course, 
alternative configurations of astrophysical relevance can be easily imagined (e.g., see Ciotti 
2010); in particular, one might be interested in the study of a mass spectrum when all 
of the field stars have the same velocity distribution independently of their mass (e.g., a 
collisionless system made of stars and dark matter). 
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8.2.1 Exponential Mass Spectrum 


In this first example, we assume that the number distribution of field stars is described by a 
pure exponential law: in this case, from Eq. (8.11) the mass spectrum can be written in full 
generality (prove it!) as 


ea m/ (ms) 
N (mp) = ng———-.. (8.20) 
(mp) 

In the equipartition case, the integral over masses in Eq. (8.15) can be performed analyti- 
cally by inserting Eq. (8.12) into Eq. (8.16), inverting the order of integration between v¢ 

and me, and finally obtaining the surprisingly simple result 

53 (5 + 23,7) O35 + 285," + 80,4) 

Hi = ——,—, Wh= 5 : (8.21) 
2(1 + o,°)°/7 41+ 0,°)7/7 


The behavior of the two functions in the limit of low- and high-velocity 4%, can be 
easily determined, and in particular Eqs. (8.15) and (8.21) prove that the result coincides 
asymptotically with the classical case for fast (H; ~ | and H2 ~ 2) and massive (R > 1) 
test particles. In general, as can be seen from Figure 8.1, the velocity factor in the case 
of an exponential mass spectrum with equipartition is larger than in the corresponding 
classical case (heavy line in Figure 8.1); for massive test particles, the maximum drag 
(corresponding to y%. ~ 0.810.) is a factor ~2 higher than in the equivalent classical case 
and fg¢ is correspondingly shorter. 


8.2.2 Discrete Mass Spectrum 


In this second experiment, we consider a system made of two species of field stars (the 
generalization to an arbitrary number of different field components presents no difficulties) 
with a mass spectrum given by 


N (mg) = 115(m¢ — my) + 24(mg — m2), (8.22) 


where 6 is the Dirac 6-function. With the convenient introduction of the dimensionless 
parameters x = n2/n, and y = m2/my, it follows that 


nim; +nom2 — 1+xy 


ne=(1+x)n1, (mp) = earns ice 


m1, (8.23) 


and the analogous identities expressed in terms of nz and m2 are immediately obtained 

from the definition of x and y; of course, the value y = | corresponds to the classical case. 

From Eq. (8.16), we now have 
E(vy,m1) + xyS(v,,m2) 


H; = , H 
1 Ex 2 


— A+x)[E(,m1) + xy? E(vj,m2)] 


(i+ xy? oe 
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Figure 8.1 The velocity coefficient &*/ i? in Eq. (8.15) for the exponential mass spectrum case of 
field particles with equipartition: 0; = v,/ (205). The curves (from top to bottom) correspond to a 
test particle with mass 0.1, 1, and 10 times the average mass of the spectrum. The heavy solid line 
represents the velocity coefficient of the equivalent classical case (i.e., when the field masses are all 
identical and their number density, average mass, and kinetic energy density are the same as in the 


mass spectrum case). 


For low velocities of the test particle, one finds (do it!) the asymptotic trends 


. 453 (1 + x)3/21 + xy>/?) 4631 + x)9/21 + xy7/?) 
pee oe 


3J/a(l+xy/2 7 3,/m(1 + xy)? 
In turn, for large velocities of the test particle, the leading terms are 


(1+x)(1+ xy?) 


Hi(0o) = 1, H2(00) = Gay? 


(8.25) 


(8.26) 
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Therefore, for fast and massive m,, the dynamical friction force in the presence of equipar- 
tition is the same as in the equivalent classical case. In the nonequipartition case, the 
correcting factor for the classical dynamical friction formula is obtained by evaluating 
Eq. (8.15) with the expansions given in Eq. (8.26). 

We now study the case of arbitrary mass ratios and velocities. For simplicity, we restrict 
the following analysis to the special case of a system in which the densities of species 1 
and 2 are the same (i.e., 21m, = n2mz2 and so xy = 1). Therefore, for x > 1, the masses 
mz are lighter and more numerous than for species 1; it is easy to recognize that the cases 
x > land x < 1 (with reciprocal values) coincide. The qualitative trend is the same as in 
the exponential case: the equivalent classical case always underestimates the true value of 
dynamical friction, with the largest deviations (at fixed R) for test particle velocities being 
comparable to the field equipartition velocity dispersion. The discrepancies can be as large 
as a factor of 6-10 for masses of the test particle in the same order of magnitude as the 
average mass of the spectrum. 


8.2.3 Power-Law Mass Spectrum 


As is commonly done in many cases of astrophysical interest, we finally assume a power- 
law spectrum peaked at low masses, with a minimum mass mj, a finite average mass 
(m-), and exponent a > lI, i.e., 


am? am; 


N (ms) = ng—, (mp) = ———,_ mp > mj. (8.27) 


mira’ a-1l 


For simplicity, no truncation at large masses is imposed. Mass integration in Eq. (8.15) can 
be done analytically as in the exponential case of Section 8.2.1 by inverting the order of 
integration between ve and me, and after some algebra* one obtains 


(2a — 3)./c0,Eg—1/2(cd,") 


Hi (0) = Erf(irv/c) We 


a>l, 


(a—1) (2a — 5) VciEa—32(ctt*) se 
" a— 7 a— CUt~Eg—3/2(CUt 
H(i) = ——— | Ef) oe , a>2, 
a(a — 2) Jit 
4 The two equations are obtained as special cases of the general identity 
; I+) Wl+aq (of 
z Y 24) + 2° TS Ey (2) 
[ x By (x!) dx ( as ) (8.28) 
0 1+a4+nuo—b 


where A > max{—1, — 1 — p(v — 1)}, fixing A = uw = 2, v = a — 3/2, v = a — 5/2, and using the recursion relations in 
Appendix A (see Ciotti 2020). 
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where c = | — I/a, the exponential integral function is given in Eq. (A.59), and finally 
a > 2 is required for the existence of solutions. From Eq. (8.17), it is easy to show that 
(a —1)° 
Hi(oo) = 1, Ho2(oo) = ———~ = 1, (8.30) 
a(a — 2) 
and this demonstrates again that for high-velocity 4% and large mass ratio ® = m;/(me) of 
the test particle the classical result is recovered. As a nice exercise, the student may wish to 
obtain the behavior of the functions in Eq. (8.29) for vy, — 0 and study the corresponding 
behavior of &*/ 0,7 as a function of a > 2. 

Summarizing the results of the simple experiments described in the last three sections, 
we learned that for fast and massive test particles the results in the classical and mass spec- 
trum cases become identical, because under these assumptions &* — 1 from Eq. (8.17) 
because for R >> 1| the specific form of the mass spectrum becomes irrelevant, as all 
of the field particles can be considered to be vanishingly small, and so Eqs. (8.6)-(8.14) 
coincide for m, = (m¢). Moreover, in the cases considered, the dynamical friction force in 
the mass spectrum case is larger than in the corresponding classical case and the dynamical 
friction times are correspondingly reduced. Notice that these results can also be extended 
to the case in which the mass spectrum particles are not at equipartition but the species 
are characterized by the same velocity dispersion (e.g., stars and dark matter particles in a 
common potential well). 


8.3 Astrophysical Applications and Final Comments 


As the student can easily realize, the astrophysical applications of dynamical friction in the 
study of the dynamical evolution of astronomical systems are counteless (e.g., see Bertin 
2014; Binney and Tremaine 2008; Spitzer 1987), ranging from the dynamical evolution 
of binary black holes in galactic nuclei (e.g., see Merritt 2013 and references therein), 
to galaxy merging and the formation of type-cD galaxies (e.g., see Tremaine et al. 1975; 
White 1976; see also Barnes and Hernquist 1992; El-Zant et al. 2004; Nipoti et al. 2004 
and references therein), to the shaping of the radial distribution of BSSs in globular clusters 
(e.g., Alessandrini et al. 2014 and references therein), to the sinking of globular clusters or 
satellites within their host galaxy (e.g., see Arena and Bertin 2007; Arena et al. 2006; Bertin 
et al. 2003; Bontekoe and van Albada 1987; Gnedin et al. 2014 and references therein), to 
modified Newtonian dynamics (Nipoti et al. 2008). We finally recall that, on the theoretical 
side, more powerful (and physically deeper) tools have been developed, allowing for the 
treatment of dynamical friction in more realistic cases than that of the infinite and homege- 
neous sea of field particles described in this chapter (which wever has the great merit of the 
physical simplicity, however!), such as the extension to anisotropic velocity distributions 
of field particles (e.g., Binney 1977) or interpretation in terms of “density wake” (e.g., see 
Binney and Tremaine 2008; Tremaine and Weinberg 1984 and references therein). 
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Exercises 


With this exercise, we estimate the asymptotic value of || Avi || for b — oo by using 
impulsive approximation. Consider the exact identity in Eq. (4.13), and evaluate it 
for vanishingly small values of || Av, || in Eq. (7.11), imposing the second identity in 
Eq. (8.1). Show that for b — oo energy conservation requires, at the lowest order, 


|Avill? 267M 

2v2 b2v3 ? 
where M = m+ me. Reobtain the same expression by direct expansion of Eq. (4.33) 
in the limit of large b. 


| Avy | ~ (8.31) 


Show that the result of the integration over the impact parameter of Eq. (7.9) for the 
quantity D(Av,) obtained from Eq. (8.1) and the second identity in Eq. (4.33) can 
be written in terms of the function 


2 
br, Drax , A>d, 
A= 2 = ~ Infiee2ael 2 : (8.32) 
m2 InA, A>o. 


Then, show that at the leading order for large A = Dmax/bz/2, 


E(u) 


ve 


so that Eq. (8.6) holds (see also Exercises 7.1 and 7.2). 
Compare the two-body relaxation time and the dynamical friction time and show that 
in the limit of large A 


D(Av)) ~ —4G?ngM? In A Vt, (8.33) 


taf ee ue (uy) = 2me 2m¢ (8.34) 


toy = E(u) met me — my + mg 
where the last identity holds for very large vy; due to the normalization condition on 
= (v;) and to the “monopole” behavior of Y(v;,). Then, use Eq. (7.23) and obtain the 
analogous expression for fa¢/tcross for the mass m; traveling in the virialized, self- 
gravitating system adopted in Section 7.3. 
Consider again the artificial system in Eq. (7.33) and evaluate again the dynamical 
friction formula Eq. (8.33): What happens for y%, >> o. and uy < o.? Then, fol- 
lowing Exercise 7.4, obtain the closed-form expression of Eq. (8.4) by using the full 
expression in Eq. (8.32). Discuss the case uy > oo. 
Generalize the 2-body relaxation time in Eq. (7.19) to the case of field stars with a 
mass spectrum. First derive the expression analogous to Eq. (8.14), 


CO 
D(|Av,, |2) = 8G2(In A)y i NGndme tiem ane 
0 


= 82 G*n¢(mz)* (In A) w Y* (vy), (8.35) 


8.6 


8.7 


Exercises 147 


where (In A)y is the natural average of In A over W implicitly defined by the first 
line in Eq. (8.35) (see also Footnote 2), and 


W*(v,) = = / N (me)? (v;,me) dmg ~ ne (8.36) 
ng (mg)? 0 Ut 


where the last expression is obtained from Eqs. (7.16) and (8.17) in the limit of very 
high velocities of the test star. Then show that 
tar W(x) 2(ms) 2H (00) 
toh = E*(u) m+ (me) RR + Hp (00)’ 
where R = m,/(me), the first expression is obtained for large values of A, and the 
last expression is obtained under the additional hypothesis of very large vy. 
With this exercise, we estimate how the Coulomb logarithm depends on m; in a 
virialized stellar system made of stars of mass mp = m, and of dark matter par- 
ticles of mass ms = mpm. Consider a two-component system as in Section 7.3, 
with M, = N.m,, Mpm = RM,, and radius R. By fixing all of the shape factors to 
unity, show that the Coulomb logarithms of m; against stars and DM can be written, 
respectively, as 


(8.37) 


2N,1 +R 2N,0 +R 
Oy ee ca mana awe 
r+l r +TrpM 
where r = m,/m, and rpm = ™pm/m.. Hints: Assume bmax = 2R and use the 


estimate 


; (8.38) 


GM,(1+R 
Pog = SC +®) (8.39) 


to obtain bz /2 as in Eq. (7.23). What is the expression for In Apy in a system made 
of dark matter only? 

Following Exercise 8.6, estimate how f2p and tg¢ depend on dark matter in a two- 
component stellar system by using the same approach as in Section 7.3 (see also 
Exercise 8.3). By using Eqs. (7.18), (7.19), (8.6), and (8.7), considering the limit of 
large vu; and then setting (quite inconsistently) vy = oy, show that 


1b Nz. + Ry 
tcross 12(n A, + Rrpm In Apm) , 


VU, 


(8.40) 


tae Ns +R) 
tcross 6[(r + 1) In Ay + Rr + rpm) In Ap] 
respectively, where foross = 2R/Y;, & 1/1 + FR from Eq. (8.39).What happens in 


a system made only of dark matter for rp) — 0? Evaluate tg¢/t2,) and compare the 
result with Eqs. (8.34) and (8.37). 


(8.41) 


Part III 


Collisionless Systems 


9 


The Collisionless Boltzmann Equation 
and the Jeans Theorem 


With this chapter, the final part of the book, dedicated to collisionless stellar systems, 
begins. As should be clear, in order to extract information from the N-body problem, we 
need to move to a different approach from direct integration of the differential equations 
of motion, and a first (unfruitful) attempt here will be based on the Liouville equation. In 
fact, the basic reason for the “failure” of the Liouville approach is that, despite its apparent 
statistical nature, the dimensionality of the phase space! [ where the function f%) is 
defined is the same as that of the original N-body problem. Suppose instead we find a way 
to replace the 6N-dimensional 9° phase space I with the six-dimensional one-particle 
phase space y: we can reasonably expect that the problem would be simplified enormously, 
and in fact along these lines we will finally obtain the collisionless boltzmann equation 
(CBE), one of the conceptual pillars of stellar dynamics. 


9.1 The Liouville Equation 


As previously mentioned, the direct approach to the N-body problem (i.e., the search for 
exact solutions of Eq. (6.1)) is in general hopeless, and so alternative approaches must be 
found. A first sensible idea could be to restrict our focus to some specific property of the 
N-body system, with the hope of being able to deduce and solve the associated differential 
equations. A particularly important family of mathematical objects that is well suited for 
this approach is that of macroscopic functions. A clear definition of a macroscopic function 
is founded on two basic geometric properties of the phase space I’ and on the concepts 
of microstate and macrostate, borrowed from statistical mechanics (e.g., see Boltzmann 
1896; Khinchin 1949; Saslaw 1987). As is well known, at any time the dynamical state of an 
N-body system is completely determined by a point in the extended phase space, so that the 
dynamical evolution of the system can be imagined as a path in the extended phase space 
originating from its initial conditions: the curve is the solution of Eq. (6.1), represented in 
Figure 9.1 by the line connecting Po to P;. 


! In the following, the phase space of an N-body system gON 


space is then I" x St, where the time coordinate appears. 


will be indicated with the standard name I’. The extended phase 


151 


152 The Collisionless Boltzmann Equation and the Jeans Theorem 


7) Liouville 


Figure 9.1 Motion of a set (0) in the extended phase space of an N-body system: due to the 
solenoidal nature of the Hamiltonian flow, Q(0) transforms into Q(t), conserving its phase-space 
volume. Moreover, points of the frontier 0Q(0) remain on the frontier 0Q(f) (e.g., see Khinchin 
1949; Meyer 1982); due to the unicity of the solution (Appendix A.10.1; see also Wintner 1947), no 
points can exit from Q(f). 


Notice that from the theorem on the uniqueness of solutions (see Appendix A.10.1), 
two different curves in the extended phase space cannot intersect each other at any time. 
In the following, each point of IP x ¢ will be called a microstate, and a function & defined 
on I’ x i will be called a macrostate: very simple examples are the kinetic energy of the 
system, its angular momentum, the number of stars with velocities in some prescribed 
range of values, and so on. These examples should convince the student that, in general, 
a microstate defines unambiguously a macrostate, but the converse is not true: a given 
value for a prescribed macrostate determines in the extended phase space many microstates. 
Therefore, given an N-body system and its initial conditions, its trajectory in the extended 
phase space can also be interpreted as the set of all microstates of the system as a function 
of time; mathematically, the time evolution of an assigned macrostate & is given, according 
to Appendix A.10.2, by its associated Lagrangian function Er. 

From this point of view, the question we tried to formulate earlier now becomes: Sup- 
pose that at t = 0 an N-body system, its initial conditions, and a desired macrostate & are 
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assigned so that & (0) is completely determined. Can we predict the time evolution of E(t)? 
Is the related differential equation simpler than the equations of the full N-body problem? 
Note that due to the invariance of & with respect to some (or many) microstates, simply 
assigning &(Q) is equivalent to assigning a set of microstates at t = 0. In the following, 
the set of microstates that at t = 0 leave the macrostate &(0) will be called an ensemble 
Q2(0) for the macrostate E. From what we have said, each microstate € (2(0) evolves 
following Eq. (6.1) and the ensemble (2(0) transforms into Q(t) (see Figure 9.1). Note 
that & is by definition invariant over (2(0), but this in general cannot be expected to be 
true for (2(t)! This can be easily understood by considering that the evolution of each 
“point” in I’ depends in general on all of the phase-space coordinates, and even if & is 
independent on some coordinate, these “invisible” coordinates can also drive the evolution 
of the “visible” ones. 

One possibility for proceeding further with this approach is to define the mean value 
for the macrostate & over (2(f); this mean value is what we call a macroscopic function. 
In order to define formally the concept of a macroscopic function and to determine its 
evolution equation, we introduce the Liouville equation (e.g., see Arnold 1978; Binney 
and Tremaine 2008; Chandrasekhar 1942; Ogorodnikov 1965; Saslaw 1987). Let f) be 
the characteristic function for the ensemble Q(t) (i.e., f% equals unity if the microstate 
€ Q(t), zero otherwise”). We now show that in phase space, from the Reynolds transport 
theorem in Appendix A.10.3 (and restricting for simplicity to Cartesian coordinates), the 
time evolution of f%) is given by the solution of the Liouville equation 


N 
+)> (0.9, ™) ayer, vn ™}) =( i05) 


DFS") ope™ 
7 i=l ia 


Dt ot 


where i is the characteristic function of (2(0), U is given in Eq. (6.2), and the meaning 
of the gradient operators is apparent. Equation (9.1) is obtained as follows: let Q(0) € T 
be an arbitrary ensemble. The number of microstates inside Q(t) cannot change over the 
course of the evolution because, for the uniqueness of solutions, microstates cannot cross 
the boundary 0&2 (t), so that the identity 


d 


ae far =0, V2(0), (9.2) 
dt Q(t) 


holds at all times, where with dl’ we indicate the phase-space volume element. From the 
transport theorem and from the fact that the vector field associated with U is solenoidal in 
I, being Hamiltonian, Eq. (9.1) follows. Notice that if we set f ©’) = 1 in Eq. (9.2) and we 
again use the transport theorem, we prove that the phase-space volume of (2 (f) is conserved 
(i.e., we prove the Liouville theorem inl). 


2 Note that from its definition fON) is a nowhere negative function over the extended phase space. 


154 The Collisionless Boltzmann Equation and the Jeans Theorem 


We are now in a position to formulate rigorously the concept of a macroscopic function. 
In fact, let & be a macrostate and (2(0) its ensemble at t = 0. At time ¢, the macroscopic 
function associated with & is given by 


6N) 2 
ie Jowd Ser 
a/f = . 

Sow) forrar 


(9.3) 


Therefore, (=); is known (at least in principle) provided that the time evolution of f ©? is 
known; from what we have said before, at tf = 0 we have (&)y = (0) by construction, 
but this is not generally true for t > 0, so we need f ©), 

The Liouville equation can be formally solved. In fact, Eq. (9.1) belongs to the class of 
quasi-linear partial differential equations (PDEs) that are defined as follows. Let x € A C 
Mt”: a first-order PDE 


n 


dy wn dy dy aj Oy 
a=. + a dj ax eas arr an D i OH, y(%; 0) = A(x) (9.4) 


is called a quasi-linear homogeneous PDE with initial condition /, and in general the coef- 
ficients are functions a; (x, y;f); moreover, we assume that ag ~ 0. When the coefficients a; 
fori = 0,1,..., do not depend on y, the PDE is said to be linear. Assume for a moment 
that the solution y is known. The A solution y = y(x;f) is a function y : RN” x Rr R, so 
that Eq. (9.4) is identically verified, and y(x;0) = h(x). The characteristic ¢ = ¢(xo; t) of 
Eq. (9.4) associated with xo is the solution of the following system of ODEs: 

dxj _ ailx, y(xo;t);¢] ey P 

dt — ag[x, y(xo;t);t]’ a (9.5) 

xi (0) = x0,i, 


where c(x9;0) = xo. The concept of charateristics is of fundamental importance in the 
theory of PDEs, because we now show that a function y is a solution of Eq. (9.4) if and 
only if it is constant in its characteristics, i.e., 


yle(xo; t);t] = h(xo), Vo. (9.6) 

Let us begin by showing that if y obeys Eq. (9.6), then it is a solution of Eq. (9.4). In fact, 

let xo be fixed and y be given by Eq. (9.6): then, from Eq. (9.5), along each characteristic ¢, 
n 


dy[e(Xo;t);t] dy dx; dy 
0= = : 9.7 
dt (Z+y dt ) ae 

= x=C 


and so Eq. (9.4) is verified. The proof that if y is a solution then it is constant along the 
characteristics is slightly less elementary, and it is based on a geometric interpretation of 
Eq. (9.4) as the inner product of the vector field of the coefficients a; and the gradient of the 
solution y, whose existence is assumed. Note that for a quasi-linear PDE the characteristics 
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depend on the solution itself, and this fact is at the basis of many remarkable properties of 
fluids such as shock waves (and of the difficulty of the subject!). However, for a linear PDE, 
the characteristics are independent of the solution y and can thus be computed (at least in 
principle!) without knowledge of the solution. We are finally in a position to discuss the 
characteristics of the Liouville equation. From Eqs. (9.1)—(9.5), it appears that the charac- 
teristics of the Liouville equation are the solutions of the ordinary differential equations 
(ODEs) describing the motion of each particle in the N-body system; therefore, we are 
forced to conclude that despite its statistical flavor, the Liouville approach is equivalent, 
in terms of mathematical difficulty, to the direct solution of the original problem. For our 
problem, we arrived at a sort of “dead end!” 


9.2 The Collisionless Boltzman Equation 


The lessons learned in the previous section force us to move from the quest for an exact 
theory to modeling, founded on additional (physically rigorous!) assumptions. This is done 
in the framework of the CBE that we now “deduce” from phenomenological arguments. 
The CBE, one of the most important mathematical tools of stellar dynamics, rigorously 
applies in the limit of perfectly collisionless stellar systems, and it represents the starting 
point for many of the topics that will be discussed in the following chapters of this book. As 
anticipated in Chapter 7, for a finite number N of particles (stars), the distinction between 
collisional and collisionless systems is not sharp, in the sense that the effects of gravitational 
interactions (the “collisions”) build up as time increases. For this reason, one might also 
wish to look for a simple modification of the CBE that is applicable to stellar systems 
in weakly collisional regimes. Such an equation, which we will briefly present later on, 
is the important Fokker—Planck equation. It aims to describe stellar systems where the 
cumulative effects of (weak) collisions cannot be neglected while still using the smoothed 
potential ¢. 

The first goal is to determine, in the perfect collisionless regime, the differential equation 
for the evolution of the smooth phase-space distribution function (DF) f introduced in 
Eq. (7.5). In Chapter 7, we obtained indications that, by increasing the number of particles 
in a system dominated by gravitational forces, the collisionless approximation (i.e., the 
substitution of the true discrete system with the continuum approximation) is better and 
better realized over longer and longer timescales. In the ideal limit of N — oo, we expect 
the collisionless approximation to be valid for any time. Our first goal is achieved with 
the CBE (also known as the Vlasov equation in plasma physics). We begin by considering 
Cartesian coordinates. Priority is given here to the illustration of the physical and mathe- 
matical ideas, without the additional difficulty of more advanced formulations; however, in 
the next section, we will consider the CBE in curvilinear (orthogonal) coordinates, the usual 
framework adopted in several applications of stellar dynamics. For the sake of generality 
and for future applications, let us assume that in addition to the potential ¢ associated 
with the smooth density ¢ as in eq. (2.4), an external smooth potential dext = Pext(X; t) is 
considered; in other words, each element of y moves under the action of the total potential 


156 The Collisionless Boltzmann Equation and the Jeans Theorem 


br = + dext (when dext = 0 the system is called self-gravitating) with the phase-space 
velocity field 


(x, V) = (v, — Vx ¢r). (9.8) 


The CBE dictates that in the collisionless regime f evolves according to 


Df _ of _ 
Dt = Or (v, Vx f) — (Vx br, Vv f) = 9, (9.9) 


where the meaning of the Cartesian operators Vx and Vy is obvious, and 


Ad (x; t) =4re [ fay, F(x, Vv; 0) = fo(x,v). (9.10) 
x3 


In fact, in the perfect collisionless regime, we can use the same mathematical arguments 
used in the derivation of the Liouville equation: for any arbitrary phase-space region Q(t) 
in y, the mass contained in QQ(f) at any time f¢ is given by Eq. (7.3) with Q = Q(t). Each 
point of &2(t) — and each point on its boundary 0&2(t) — moves according to the vector 
field determined by the gradient of the potential éy: from the uniqueness of the solutions 
of the ODEs, no orbits can leave this volume. In other words, the mass contained in Q(f) 
is conserved at all times and the vector field induced by ¢r7 is solenoidal, so that from the 
Reynolds transport theorem we finally obtain Eq. (9.9) (for more thorough discussions, see, 
e.g., Binney and Tremaine 2008; Chandrasekhar 1942; Ogorodnikov 1965; Saslaw 1987; 
Spitzer 1987; Woltjer 1967). 

Of course, as with the Liouville equation, the CBE solution can be formally obtained in 
terms of its characteristics, now given by the solutions of the system in Eq. (9.8) with initial 
conditions x(0) = xo and w(0) = vo; therefore, the characteristics associated with the CBE 
are curves in the extended one-particle phase space, and the problem is now enormously 
simplified with respect to that posed by the solution of the Liouville equation. Unfortu- 
nately, we do not yet have a complete mathematical understanding of the properties of orbits 
in three-dimensional time-dependent (or even time-independent!) potentials. Therefore, the 
problem posed by the general solution of Eq. (9.9) — despite its apparent simplicity — is still 
too difficult to be solved in general cases. 


9.2.1 The CBE in Curvilinear (Orthogonal) Coordinates 


Now, it is obvious that in “practical” applications of the CBE Cartesian coordinates are not 
the preferred choice, and so the student may legitimately ask how to rewrite the CBE in 
curvilinear coordinates. As would become clear to anyone bravely attempting to change 
coordinates directly in the CBE (or in general in PDEs), this is not an easy task. A first 
possibility (e.g., see Bertin 2014; Binney and Tremaine 2008; Woltjer 1967) is to use 
canonical coordinates and the transport theorem. In fact, let us consider Eq. (A.174), and 
suppose we change coordinates from Cartesian to another set of canonical coordinates 
(q, p): of course, from Eq. (7.3) the function 
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AM[Q(t)] = f (xvi t)d?xd?v = f'(a.p;t)J (q,p)d?qd*p, (9.11) 
Qt) Q'(t) 
where J is the determinant of the Jacobian? of the transformation, is independent of the 
coordinates (canonical or not!) adopted. Finally, in the last integral 


W’ = (VpH"', — VqH’) (9.12) 


is the velocity field in phase space produced by the new Hamiltonian function, under which 
the region Q’(t) moves.* Now, the use of the canonical coordinates leads to two important 
consequences: 


3 / / 

aw! aw’, 

apie, > (T+) <0 (9.13) 
i=l dqi Opi 


For a general proof of the first identity see Binney and Tremaine (2008), and for the 
special case of curvilinear orthogonal coordinates see Exercise 9.2. The latter is simply 
the manifestation of the solenoidal nature of Hamilton equations in phase space. Repeating 
the same line of reasoning adopted for the derivation of the CBE in Cartesian coordinates, 
we conclude that under the assumption of perfect noncollisionality AM is conserved for 
arbitrary &2(t), and again the transport theorem in phase space y for a DF f expressed in 
canonical coordinates leads to the formal rewriting of Eq. (9.9) as 


Df’ _ af ae on af") af’ 
= 7) 
i=l 


= ' H'}=0, 9.14 
Dt at Opi OGi = OGi::«OPi ne a 


ot 


where [f’, H’] is the Poisson bracket between* f’ and H’. This result is really beautiful, 
and it allows us to write the CBE in arbitrary canonical coordinates. As we did in the case 
of the Liouville equation, in the present case we can also fix f = f’ = 1 in Eq. (9.11) 
so that AM[Q2(t)] = Volume[{2(r)], and using the transport theorem again, we obtain the 
proof of the Liouville theorem in y. 


In a canonical coordinate transformation, the Jacobian matrix is symplectic (see Footnote 2 in Appendix A); therefore, its 
determinant evaluates to | and no absolute values is required in the integral. 

We recall that in general the quantities q = (q1,...,¢n). 4 = (91, ---,9n), and p = (pq,..., pn), which, for simplicity, we 
indicate with bold face letters as true vectors, are not vectors (e.g., like x, v, and v), but simply lists. Similarly, symbols such as 
Vq are used to indicate in a compact and suggestive way the list of partial derivatives. 

For a given Hamiltonian system with H = H(q,p), the Poisson bracket of two regular functions U, V is defined as 


n 


aU av. aU aV 
u.vl=)>> ( : coe ) : (9.15) 
2 \ 8G: OPI OP: OG: 


Notice that the Poisson brackets can be interpreted as the advective term of the material derivative in phase space, where the 
“velocity” is given by the Hamiltonian function. Finally, U and V are said to be in involution if their Poisson bracket 
vanishes, i.e., 


[U,V] =0. (9.16) 
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However, two comments are in order here. The first is that the student should avoid the 
erroneous idea that the CBE can be transformed in curvilinear coordinates only through 
the procedure just described. Second, in many applications the CBE is written by using 
curvilinear coordinates (q) and velocities v (see Exercise 9.1), not conjugate momenta 
p = V@qZ£; therefore, it is natural to ask what happens to Eq. (9.14) because general- 
ized velocities are not (in general) canonical variables, and so Eq. (9.13) in general does 
not hold! In order to answer this important question, we can proceed as follows. First, 
notice that 


MQ) = / fxv;1)dx = i F'a.v;t)|J(q,»)\d2qd? (0.17) 
Q(t) Q'(t) 


is obviously true, but now the Jacobian J is in general different from unity and the trans- 
formed six-dimensional “velocity” field W’ = (q, ») is in general not solenoidal. However, 
AM[2(t)] is conserved, being the same as in Eq. (9.11), and so from transport theorem 


Pr 


/ Odi Ov; 

+ f’|J|divW =0, div W’ = (H+ =). (9.18) 
We can simplify the first expression in Eq. (9.18). In fact, consider again Eq. (9.17) with 
f = f’ = 1: the first integral gives only the phase-space volume of Q(t), and as we have 
seen, this volume is conserved in Cartesian (or canonical) coordinates according to the 
Liouville theorem (see also Appendix A.10.3), so that the last integral is also independent 
of time. The corresponding equation for |J| is then obtained immediately from Eq. (9.18) 
with f’ = 1. If we now expand the material derivative® in Eq. (9.18) and we collect the 
vanishing term associated with volume conservation, we see that in generic curvilinear 
coordinates the CBE reads 


Df’ _ af’ af af’\ _ 
Pe Fay (a ie +5) =o Org 


(e.g., see Ogorodnikov 1965). Restricting ourselves to curvilinear orthogonal coordinates, 
from Eq. (9.41) we finally prove that the CBE can be written as 
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where aj jx = (0f;/0q;,f;). As a useful exercise, the student should repeat the analysis 
and obtain the CBE for f’ = f’(q,q;r) (i.e., for the DF expressed in terms of generalized 
coordinates and velocities). 


6 Notice that D(a b)/Dt = b Da/Dt + a Db/ Dt from the definition of the material derivative in Eq. (A.173). 


9.2 The Collisionless Boltzman Equation 159 


Figure 9.2 Motion of a set 2(0) in the one-particle phase space y, where {2(0) is transformed into 
Q(t) under the action of the smooth potential ¢: no points can leave the set, the volume is conserved, 
and the CBE in Eq. (9.9) follows. However, the true evolution of the system is not determined by the 
smooth potential @ but by the true (granular) potential ¢true in Eq. (7.1), and after some time point 
(P}™®) it can leave/enter Q (f) without violating the unicity of the solution of the differential equation: 
the associated loss and gain of particles is described by the sink/source terms in the collision operator 
in Eqs. (9.21) and (9.22). 


9.2.2 The Fokker—Planck Equation 


We recall again the main differences between the CBE and the Liouville equation. Both 
equations hold rigorously in their domain of application (i.e., the Liouville equation is the 
exact equation describing the motion of a general N-body system in phase space I’ and 
the CBE is the exact equation describing the motion of a perfectly collisionless system in 
phase space y). As anticipated, we now discuss how to modify the CBE for the description 
of systems with finite N over time scales longer than fp (i.e., gravitational systems in the 
collisional regime), while retaining the advantages of working with ¢ instead of true. 
From what we said at the beginning of Section 9.2, the fact that the number of particles 
of the true system can change inside the control volume (2(t) associated with the smooth 
potential # (see Figure 9.2) can be modeled by adding to the right-hand side of the CBE a 
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source (or collision) term. This collision operator is expected to depend on the smooth DF 
itself; in other words, we can write 


=i CIf], (9.21) 
Dt 
where C[ f] is, for the moment, an unspecified function that should depend not only on the 
present value of f, but also on its previous history (i.e., on f over all of the extended phase 
space. The Fokker—Planck equation is obtained from specific (and physically motivated) 
assumptions on the nature of the function C[f]. The student interested in its rigorous 
derivation can find excellent presentations in the literature (e.g., see Binney and Tremaine 
2008; Saslaw 1987; Spitzer 1987). Here, in line with the introductory level of this book, we 
follow a more phenomenological (but no less physically sound) treatment. 
A first simplification is to assume that the collision operator C[ f] at time ¢ is completely 
determined by f at the same time (i.e., the previous evolution history of f is ignored’). 


Then we can write 
= of af 
cuni= ($F) +(Z) (9.22) 


where the subscript = refers to particles that enter and leave the phase-space volume around 
the six-dimensional point w, a generic name for (x,v), (q,p), or (q,v). A probability 
function Y(wy),Ww2;f) is then introduced, so that Y(w 1, w2;1)d°w2 is the scattering rate 
of particles coming from w, in the volume element d 6w> around w; + wo. Therefore, 


(~) = [ vow —wewin fo windew, 
Oty 4 Jy 
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(9.23) 
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Equation (9.21) with the resulting collision term is called the Master equation. 

A second simplification is motivated by the fact that the main effects of the gravitational 
scattering are due to weak encounters (see Chapters 7 and 8; i.e., events that change by 
small amounts the coordinates w). Mathematically, this means that the main contributors 
to the collision terms are from phase-space regions “near” to w (i.e., with “small” w’). By 
expanding up to the second order the first identity of Eq. (9.23) and by adding the second 
identity, we obtain (do it!) the collision term in the Fokker—Planck approximation 


6 6 2 
af (w;t) Di(w) 1 0° f (w,t) Dij (w) 
C = ; 9.24 
if] Be Ow; 7 2 2 Owj0w; ( ) 
=1 ingest 
where the diffusion coefficients are given by 
Dj(w) = / wit (ww; t)d°w, Di; (w) = i ww) WU (w, w; t)dow’, (9.25) 
Y y : 
fori, 7 = 1,...,6. A third and final simplification is discussed in Exercise 9.6. 


7 In statistics, this is called a Markov property. 
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We conclude this brief excursus about the Fokker—Planck equation by recalling that it 
is an invaluable tool for numerical studies of the evolution of stellar systems in a weakly 
collisional regime, but we will not discuss these issues any further. The remaining part of 
this book, starting with the next section, is dedicated to the idealized class of stellar systems 
that are (1) stationary and (2) perfectly collisionless. Even though these two assumptions 
can be considered to be very strong, a significant part of the results in stellar dynamics are 
obtained within this framework. 


9.3 Integrability and the Jeans Theorem 


From a physical point of view, the assumption of stationarity is interesting because the 
majority of stellar systems such as galaxies and globular clusters appear to be described 
by regular morphologies, smooth surface brightness distributions, and relatively simple 
stellar kinematics. All of these properties suggest that the objects are in a stationary (or 
quasi-stationary) state. From a mathematical point of view, stationarity is also interesting 
because a powerful theorem (the Jeans theorem) restricts the possible dependence on the 
phase-space coordinates of the equilibrium phase-space DFs. In particular, we will show 
that for stationary collisionless stellar systems the DF must be expressed in terms of regular 
and global integrals of motion; unfortunately, with the exception of the classical integrals 
related to the obvious symmetries of the potential, their mathematical description can be 
very complicated, and in many cases these additional integrals may not exist at all. 

In order to prepare for the following discussion, we begin by writing the CBE Df/Dt = 
0 for a stationary stellar system; for example, from Eq. (9.9) we see that the mathematical 
formulation of our two assumptions, once expressed in Cartesian coordinates, is 

af 


re =0 VW ie, [f,H] = (v, Vx f) — (Vx br, Vy f) = 0. (9.26) 


Of course, we can (and we will!) also consider the equivalent formulations obtained by 
imposing time independence in Eqs. (9.14) and (9.19). In addition, it should be easy to 
see that once f is independent of time, all of the macroscopic quantities derived from 
f (see Chapter 10), in particular density and potential from Eq. (9.10), are independent 
of time (i.e., dd/dt = O Vr); clearly, in a stationary system with an external potential, 
Idext/It = 0. 


9.3.1 Integrability 


As we have seen in Sections 9.1 and 9.2, there are geometric objects naturally associated 
with an equation such as Eq. (9.26) (i.e., the characteristics) that in the present stationary 
case are the solutions of the general system in Eq. (9.5) specialized to the autonomous (i.e., 
time-independent) case and with the “velocity” field given by Eq. (9.8), as we are using 
Cartesian coordinates for simplicity. However, the following discussion is independent 
of the coordinates adopted; moreover, the student is warned that the remaining part of 
this chapter is necessarily more technical than other parts of this book, and the interested 


162 The Collisionless Boltzmann Equation and the Jeans Theorem 


student is encouraged to consult more advanced treatises (e.g., Arnold 1978; McCauley 
1997; Whittaker 1917; Wintner 1947; see also Ciotti 2000). 

We begin with a concise review of some basic facts about the integrals of motion (some- 
times also called first integrals) for a generic ODE — fundamental geometric objects asso- 
ciated with the characteristics. An integral of motion for a generic autonomus ODE such as 
that in Eq. (A.166), with solution Y = W(x; 7) as in Eq. (A.167), is introduced as follows: 


(1) A function /(x) : A C 8" + NR, so that Vxo € A 

dIg(xo;t) _ 

dt 

is called a integral of motion for the generic field W(x) over A; I is the Lagrangian 

function associated with 7, and for the moment x indicates generic coordinates in ”. 

(2) If J is an integral of motion and J € cv )(A), with r > 1, then / is called a regular 
integral of motion for W over A. 

(3) If 7; fori = 1,...,k are regular integrals of motion over A and the vectors Vx /j 

are linearly independent over A, then the k integrals J; are said to be functionally 

independent over A. 


0, Ie (xo;3t) = IW (x0; 1)], (9.27) 


(4) If 7 is a regular integral of motion for W and satisfies the implicit function theorem 
over A with respect to one variable x; (e.g., Edwards 1994; e., if 1° = 1(xo)), then a 
function h; exists so that 


0 0 
xj = hex, ...,%7-1, X41, --- Xn), 101, ...,Xi-1, hi, Xi41, .-.,4%n) = I. 
(9.28) 


Then / is an isolating integral of motion for W over A with respect to x;. 

(5) In the above definitions, the integrals of motion are global (i.e., they exist over all A); 
when they exist only in a local sense (i.e., when A is a “small” region around xo), they 
are called local integrals of motion. 


The student may ask why we are so concerned with the properties of the integrals of 
motion. The answer is provided by the concept of Jacobi—Lie integrability (e.g., see Eq. 
(1.6)). An autonomous ODE such as that in Eq. (A.166) is said to be integrable in the sense 
of Jacobi-Lie if and only if n — 1 functionally independent, global, isolating integrals exist. 
Otherwise, it is called nonintegrable. It should be clear that the property of “integrability” 
does not refer to our ability to “solve” some difficult integral and/or to define some “new” 
transcendental function in terms of which one can express the solution. On the contrary, 
integrability is an intrinsic property of a system related to properties of W. Let us discuss 
the geometric meaning of Jacobi—Lie integrability. When the assumptions given in Eq. 
(9.28) are verified for the first n — 1 coordinates, one can write for a generic initial condition 
xo € A, and globally over A: 


HERG ust ate 2S Hl, 


7 (9.29) 
Xn 0 70 0 
dt = Wilh, ha, ...,An-1,Xn) = wily. t, see Ti 4+Xn)s 
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where we used the integrals of motion to express® the coordinates i = 1,...,n — 1 in 
terms of x,, and the values ii are fixed by the initial condition x9. The remaining differ- 
ential equation for x, can be integrated (at least in principle!) by separation of variables. 
Geometrically, the solution of the ODE in the case of Jacobi—Lie integrability can there- 
fore be seen as the intersection of global functions; a very important example is given 
in Chapter 4, where the solution of the two-body problem was in fact obtained without 
performing any integration! In other words, the Jacobi—Lie integrability condition is the 
technical translation of the old-fashioned name of integration by quadratures. Note that 
a generic autonomous ODE satisfying the conditions for the existence, uniqueness, and 
regularity of the solutions (see Appendix A.10.1) possesses global, unique, and regular 
solutions, even without being Jacobi—Lie integrable. In other words, a nonintegrable ODE 
(in the Jacobi-Lie sense) is characterized by a vector field that does not possess a sufficient 
number (n — 1) of functionally independent global isolating integrals of motion. 

The requirement of globality is of fundamental importance for integrability, as we now 
illustrate. In fact, let us consider a system (not necessarily integrable) for which a solution 
exists. Then the following flow-box (or rectification) theorem holds (e.g., see McCauley 
1997): let xy € A C MR” be an ordinary point for the vector field W € C)(A) withr > 1 
(1.e., W(xo) 4 0). A neighborhood Ag of xo and a function @ € C r )(Ag) exist so that, with 
the coordinate transformation 


0 _ 
y=G), Untx) = %, (0.30) 
OX; 
Eq. (A.166) transforms into 
31 = Uj (Xo) W(X) = 0, in = Une (Ko) We(x) = 1, (9.31) 
fori = 1,...,n — 1. The solution of Eq. (9.31) is straightforward, i.e., 
yi = yp = Gi (%), Yn =F. (9.32) 


In practice, with the change of coordinates in Eq. (9.30), the solution near the regular 
point xg is transformed into a linear “parallel” flow. Obviously, the specific form of @ 
is determined by W. Note also that the first n — 1 components of (xo) are regular, local 
integrals of motion for W; in other words, the regular /ocal integrals of motion are simply 
the ¢; functions fori = 1,...,n — 1. A natural question arises: Could we use the initial 
conditions as global integrals of motion for any ODE? After all, the n coordinates of the 
initial conditions are conserved for any time along the orbit, not only locally! Unfortunately, 
as we will now prove by using two different lines of reasoning, the answer is negative. 
The basic idea in the geometric argument is that of the time elimination between the n 
components of the general solution by using initial conditions as integrals of motion. In fact, 
when the solution W exists (and is unique), from Eq. (A.168) it follows that xo = W(x; —f). 
The possibility of using initial conditions as global integrals of motion is equivalent to the 


8 Obviously, the functions h; in Eqs. (9.28) and (9.29) are not the same. 
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time elimination for all times between n — 1 components of W. In fact, let us assume that 
for some k we can obtain globally t = Y, ! (x, xe): then, after substitution, we would obtain 


Wilx, — Wy! (x,x?)] = xf (9.33) 


L 

fori = 1,...,, and so the functions above would be a family of global integrals of motion 
for W. The problem here is that there is no reason to expect that the function Y, ! will be 
defined any better than in a local sense. Notice that this remark is true even for systems that 
are actually Jacobi—Lie integrable (see Exercises 9.7 and 9.8)! The general impossibility 
of using the initial conditions as regular, global integrals of motion, even though they are 
conserved along the solution of a generic (well-behaved) ODE, can also be shown in a more 
formal way by means of the flow-box theorem. In fact, using initial conditions as global 
integrals of motion is equivalent to requiring the validity of the rectification in Eq. (9.31) 
globally; in other words, that the transformation Uj, (x) satisfies globally (1.e., for all x) the 
identities 

Uig (xX) Wek) = 0, F=1,...,2-—1; Ung (XW, (x) = 1. (9.34) 
Now, let us consider what the consequences are of the combination of Eqs. (9.30) and 
(9.34). From Eq. (9.30), the following differential 1-forms are associated with the change 
of coordinates 


dy; = Uix(x)dx, = 0. (9.35) 


In order to define a function (and so to be global, regular integrals of motion, because for 
each solution dy; = 0), the functions y; also need to be path independent and the vectors 
fields U;, need to be globally exact, in turn requiring the closure condition 
AUji (x) _ OVim (x) 
OXm Ox] 


(9.36) 


fori = 1,...,n and/ ¢ m, a condition that surely cannot be expected to be satisfied for 
general vector fields W. This is the reason why initial conditions cannot be used more than 
locally (i.e., for small t) as regular integrals of motion. In other words, for nonintegrable 
ODEs, the initial conditions do not define global holonomic coordinates. In contrast, in 
Jacobi—Lie integrable systems, the global vector field generated by the flow-box theorem is 
exact, and so 


:(x) = I(x), i=l,...,2—1. (9.37) 


The interested student is strongly recommended to consult the remarkably clear book of 
McCauley (1997), from which the following passage is quoted verbatim: 


. every non-integrable flow in phase-space has n — 1 time-independent global conservation 
laws,... but they are singular: during a finite time 0 < t < t, the flow can be parallelized via a 
Lie transformation, but at time t, one of the conservation laws has a singularity and so there is no 
parallelization by that particular transformation for t > t,. According to the Flow-Box theorem, 
however, there is nothing special about the time t,: the flow can again be parallelized locally for 
t > ty, and the same argument applies again. This means that the singularities of the conservation 
laws of anon-integrable flow must be like branch cuts or phase singularities: moveable and arbitrary, 
like the international date line in an attempt to impose linear time on circular time. 
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9.3.1.1 Integrability in Hamiltonian Systems 


How do the previous general considerations apply to the special class of fields W in Eq. 
(9.12) derived from Hamiltonian functions? We can legitimately expect that the special 
(symplectic) nature of the differential equations adds structure (and consequences) to the 
integrability of Hamiltonian systems. This is in fact the case, because from Eq. (9.26) it is 
apparent how a time-independent function U is a regular integral of the motion for the time- 
independent Hamiltonian H if and only if [U, H] = 0 (we., if and only if U is in involution 
with the Hamiltonian; see Footnote 5). This makes clear the importance of Poisson brackets 
when dealing with integrals of motion. In particular, let U, V, Z be regular functions defined 
over the 2n-dimensional phase space I and let a, b be constants: it is not difficult to prove 
(do it!) that the Poisson brackets satisfy the following relations (a Lie algebra; e.g., see 
Arnold 1978; Landau and Lifshitz 1969; McCauley 1997): 


U,V] =-IV,U], 


[ 
b = b 

[aU + bV, Z] = a[U, Z] + DIV, Z], coo 

[UV, Z] = U[V, Z] + VIU, Z], 

[U 


V,Z]] +[Z,[U,V]] + [V,[Z,U]] =0 


The last identity is known as the Jacobi identity (see also Exercise 13.25). Notice that from 
the Jacobi identity with U = H, one obtains that if V and Z are two regular integrals of 
motion for H, so it is? [V, Z]. The relevance of the concept of involution for Hamiltonian 
integrability is made clear by the following fundamental result: let H = H(q,p) be a 
Hamiltonian function over I’. It can be proved that (e.g., see again Arnold 1978; Landau 
and Lifshitz 1969) the field W generated by H is Jacobi—Lie integrable (or completely 
canonically integrable) if and only if 


(1) There exist n global isolating integrals of motion /;. 
(2) J; are functionally independent over I. 
(3) They are in involution over I (we., [Jj, /;] = 0 fori, j = 1,...,n). 


Thus, the symplectic nature of the 2n-dimensional Hamiltonian fields W reduces the num- 
ber of independent integrals of motion required for Jacobi—Lie integrability (provided they 
are in involution!) from 27 — | (as would be required for a generic field of 2n dimensions) 
ton. Note that when the Hamilton—Jacobi equation is separable in some coordinate system, 
the constants of separation are global isolating integrals of motion (e.g., see Arnold 1978; 
Goldstein et al. 2000; Landau and Lifshitz 1969), even though not all integrable systems 
are separable (e.g., see Gutzwiller 1990). We also recall that special Hamiltonian systems 
exist with a number of integrals greater than n: these systems are called superintegrable 
(e.g., see Evans 1990). 


9 Unfortunately, the integrals generated with this procedure are in general not “new.” Also notice that the first identity in Eq. 
(9.38) confirms that, for an autonomous system, H itself is an integral of motion for [H, H] = 0. 
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We conclude this section by recalling that in integrable Hamiltonian systems there exist 
a special class of integrals of motion: the actions J (not to be confused with angular 
momentum!). Actions are the natural momenta in integrable Hamiltonian systems, and the 
associated coordinates are the so-called angle variables 8. Among their other remarkable 
properties, the beauty of actions is that in integrable systems the Hamiltonian function can 
be rewritten so as to depend only on actions, with 


6=VjHD), J=—-VoH() =0, (9.39) 


so that the natural motion in integrable Hamiltonian systems is characterized by con- 
stant actions and by the angle coordinates evolving at constant angular velocity (e.g., see 
Arnold 1978; Binney and Tremaine 2008; Goldstein et al. 2000; Landau and Lifshitz 1969; 
Lichtenberg and Lieberman 1992). In a certain sense, this result proves the quite amazing 
fact that, after all, we are only able (when using appropriate coordinates) to solve motion 
completely in cases of uniform “velocity!” 


9.3.2 The Jeans Theorem 


How do the above results apply to stationary collisionless stellar systems? This question 
is answered by the so-called Jeans theorem (e.g., Binney and Tremaine 2008; see also 
Lynden-Bell 1962c). Let f be the DF over y of a stationary collisionless stellar system. 
Then f depends on the phase-space coordinates only through the regular integrals of motion 
of rt, 1.e., 


f=fUh,...) (9.40) 


(see Exercise 9.10). With the aid of this powerful result, in the following chapters we will 
be able to explore some exact models for stationary collisionless stellar systems. 

Unfortunately, even though an enormous body of literature on this topic is available 
and many deep results are known, no general methods able to determine whether a given 
system is completely canonically integrable are presently known. In practical examples, 
the simplest way to find integrals of motion makes use of known (more or less evident) 
symmetries of the system (Noether theorem; e.g., see Landau and Lifshitz 1969). For 
example, let us assume ¢r to be the potential of a gravitating system. Then: 


(1) If the potential is time independent, d¢r/dt = 0, then H = E is a global isolating 
integral (i.e., energy E is conserved along the orbit of each star). 

(2) If the potential is time independent and spherically symmetric, dy = ¢r(r), then H = 
E and J are global isolating integrals (i.e., energy E and angular momentum J are 
conserved along the orbit of each star). 

(3) If the potential is time independent and axisymmetric, dr = ¢r(R,z), then H = E 
and J, are global isolating integrals (i.e., energy E and the axial component J, of the 
angular momentum are conserved along the orbit of each star). 

(4) If the potential is time independent and separable in ellipsoidal coordinates (i.e., of 
the Stackel family; see Section 13.2.3), then H = E and the other two constants of 
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separation of the Hamilton—Jacobi equation are conserved along the orbit of each star 
(e.g., see de Zeeuw 1985a; de Zeeuw and Lynden-Bell 1985; Dejonghe and de Zeeuw 
1988 and references therein). 

(5) More generally, if the potential is completely canonically integrable, then according to 
Eq. (9.39) the three integrals can always be identified with the three actions (J, J2, J3) 
(see chapter 4 in Binney and Tremaine 2008 and references therein). 


Exercises 


9.1 Consider a system of curvilinear orthogonal coordinates q = (q1,q2,q3) (see 
Appendix A.8) and consider the orbit x[q(t)] of a point under the action of the 
potential @. First, show that the components v; of the velocity v = x along the 
unitary curvilinear versors f; and their time derivatives v; can be written using the 
Lamé coefficients h; in Eq. (A.140) as 


3 
: ; lo VjVE of; 
skids =O + YS oi, aie = (SEA, (Al) 
3 J 
J; . 


where we suspended the convention of summing over repeated indices.!° In particu- 
lar, compute the @;;, symbols in spherical and cylindrical coordinates and show that 
in the two coordinate systems (in the usual notation for generalized velocities) 

Ur =f, Vg = rd, Vg =rsinv¢, 


ab , M+ % 


Uy = — sa. ; 

or r 

(9.42) 

ldd vu, Ve Us 
vy = --— cotv—, 
’ r ov r as r 

1 oa 

Ug = - oe cote oe 

rsind dg r r 


while in cylindrical coordinates 


vR=R, ve =RGO, vz, =%, 


dp UG 

ae 

Lae tay (9.43) 
"Roe R 

_ _ 99 

ae 


10 The coefficients aj jx could be expressed in terms of Christoffel symbols (e.g., Narashiman 1993). From the orthonormality 
of the basis f;, it follows immediately that @; ;~ = —ajj;, so that aj, =O fori =k. 
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Hints: By definition v; = (v,f;), so that from Eqs. (A.139)-(A.141) 


3 3 
dx 
age a, igifi, (9.44) 
i= i=1 
and this proves the first identity in Eq. (9.41). The generalized accelerations are then 
obtained starting from the obvious identity v; = (v,f;) + (v,f;) and using 
dv ome 7 
L 
= qj-fx) fk. (9.45) 
dt ae 0qj i 


where in the last espression the derivative of the unitary versor is decomposed in the 
f; basis itself, and finally g; = v;/hj; from the first identity of Eq. (9.41). 

The determinant of the Jacobian J of canonical coordinates transformations evalu- 
ates to 1, being J symplectic (e.g., see Arnold 1978; Binney and Tremaine 2008; 
Lichtenberg and Lieberman 1992). With this exercise, we prove by direct evalua- 
tion that this in fact is true when (q,p) are associated with curvilinear orthogonal 
coordinates. First, from the coordinate transformation x(q), show that for a natural 
Lagrangian 


3 3 


v=) -ufi=>> rf (9.46) 
i=l i=l 


where h;(q) are the Lamé coefficients. Then compute 


d(x, Vv) 
€ ’ 

d(q.P) 
and show that it is the determinant of a block diagonal matrix. From the well-known 


properties of determinants and Eqs. (9.46) and (A.141), then show that |J|=1. 
Repeat the exercise with curvilinear coordinates and velocities v, and show that in 


J(q,p) = (9.47) 


this case 
IJ (q,v)| = Aih2h3 (9.48) 


(i.e., that at fixed q we have d*v = d*v). Finally, show that in action-angle variables 
(J,0), d°xd?v = d?Jd*0 = (2m)3d°J, where the last identity holds for integrable 
systems performing periodic motions. Hint: Consider a natural Lagangian £ = T—U 
(i.e., a Lagrangian with U independent of q) and show that in curvilinear orthogonal 
coordinates 


we 


a hogs = hii. (9.49) 
Ll 


Pi = 


9.3 


9.4 


9.5 


9.6 
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From Eq. (9.20) and Exercise 9.2, show that the CBE in spherical coordinates is 
given by 


af af vo af vy of 
at ‘dr or 80 orsind ag 


ap vwstu2\ a 1/a a 
(* ¥ s) f (<6 + wv» — cots v2) 2 


or r dv, r \av Ovy 
1 1 oO¢ of 
r (a5 dp Se vty) dVg ne 


while in cylindrical coordinates 


af af waf af 
fo aR” Roe be 


ao ve\ af 1 (ad af ad af 
ee ot 9.51 
(3 =) dur R \dg ay dg = 0Z OUz ae 


(e.g., see Binney and Tremaine 2008; Ogorodnikov 1965). 


Consider a spherical system with ¢ = ¢(r) and a phase-space DF f(€) depending 


only onr, v;, and v; = ,/ Ty + Va Show that Eq. (9.50) reduces to 


2 
Dg (3 “) Of eg, (9.52) 


ot "or or r ) Ov, r Ov; ~ 


Consider an axisymmetric system with @ = #(R,z) and a phase-space DF f(€, Jz) 
independent of g. Show that Eq. (9.51) reduces to 


of af af (# *) Of upd, Of db af _ 


of "aR “ae \GR Rf} dup. RK Bip 02 avs 


0. (9.53) 


The diffusion coefficients in Eq. (9.25) can be further simplified by considering the 
so-called local approximation, i.e., 


Ww, wt) = 6(x’) Wv,v5), (9.54) 


in which one assumes that collisions change only the velocities of particles, while 
their positions remain unchanged. Show that Eq. (9.24) becomes 


3 3 


_ af Div) 1 a? f Dij(v) 
C= dX, ce toh cee (9.55) 


(e.g., see Binney and Tremaine 2008; Spitzer 1987). 
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9.7 


9.8 


9.9 


The Collisionless Boltzmann Equation and the Jeans Theorem 
Let us consider the standard two-dimensional harmonic oscillator described by 
¥=—-Mx, jo py, (9.56) 


where A and yz are two positive constants. Prove that the two isolating, global integrals 
of motion are the two energies, i.e., 


2 2,2 “2 2,2 
Xo Ax yoo wy 
LS LS 9.57 
pe 5 a a (9.57) 
Moreover, prove that the energy of the system is conserved, with 
x2 y2 
Be ee ded: (9.58) 


Prove that the general solution of Eq. (9.56), in terms of initial conditions, is given by 


Jz 
xr 


cos(At — 9°), 


x(1) = xocos(At) + sin(At) = 


(9.59) 
yo 21 0 
y(t) = yo cos(ut) + ry sin(ut) = cos(ut — gy), 
where, for the x component, 
0 ax, xo = 0, AX0 
oy = a, = 0 < Arccos <7, (9.60) 
Qn —ay, in <0, V2 


where Arccos is the principal determination of arccos. By inverting the first identity 
of Eq. (9.59), one obtains 


0 x(t) 
At = g, + 27k — Arccos ; (9.61) 
219 
where k = 0, 1, 2,..., and t = 0 for k = 0. The impossibility of a global 
inversion with respect to time for —co < t < oo is apparent. But this is only the first 
inconvenience; in fact, from Eqs. (9.59)-(9.61) 


21° 
y 27 wk Ax (t 
y(t) = ; cs| sl Kanon | at 4 + Po - “| : (9.62) 
in 


Xr 21 r 
Show that for a given x(t), if ~ and A are rationally dependent, y(t) assumes a 
finite number of values (in general more than one), while if jz and A are rationally 
independent, the set of y(t) corresponding to a given x(t) is dense (e.g., see Woltjer 
1967). 
By using Eq. (9.15), prove by explicit computation of the Poisson brackets that for 
the canonical variables in an n-dimensional Hamiltonian system 


[pis Pjl=lai.gj1=9, [ai,pjl=Sij, i.7 =1,...,n. (9.63) 
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Moreover, prove the well-known relations obeyed by angular momentum J in Carte- 
sian coordinates 


[Ji Ji] = eijeJe, (vi, (ISI? = 0 (9.64) 


(e.g., see Goldstein et al. 2000). 

9.10 Prove the Jeans theorem. Hints: If f is a time-independent solution of the CBE 
(9.26), then from Eq. (9.27) f is an integral of motion. Conversely, assume that 
f=fH,h,..., 1%) as in Eq. (9.40) and conclude 


a (9.65) 
i=1 


because DJ; /Dt = 0 fori = 1,2,...,k. 


10 


The Jeans Equations and the Tensor Virial Theorem 


In this chapter, we show how the (infinite) set of equations known as the Jeans equations are 
derived by considering velocity moments of the collisionless Boltzmann equation (CBE) 
discussed in Chapter 9. The Jeans equations are very important for physically intuitive mod- 
eling of stellar systems, and they are some of the most useful tools in stellar dynamics. In 
fact, while the natural domain of existence of the solution of the CBE is the six-dimensional 
phase space, the Jeans equations are defined over three-dimensional configuration space, 
allowing us to achieve more intuitive modeling of directly observable quantities. The phys- 
ical meaning of the quantities entering the Jeans equations is also illustrated by comparison 
with the formally analogous equations of fluid dynamics. Finally, by taking the spatial 
moments of the Jeans equations over configuration space, the virial theorem in tensorial 
form is derived, complementing the more elementary discussion in Chapter 6. 


10.1 The Method of Moments 


As we have seen, the general solution of the CBE depends on knowledge of the proper- 
ties of orbits in three-dimensional (possibly time-dependent!) potentials, a problem that is 
well beyond the present possibilities of mathematics (as openly admitted in Arnold 1978). 
Therefore, in addition to the use of numerical simulations, various techniques have been 
developed in order to “extract” information from the CBE. These methods can be broadly 
divided into (1) the method of moments and (2) the construction of particular solutions for 
stationary systems from the Jeans theorem; as we will see, in the common case of stationary 
systems, the two approaches can be used together to maximum profit. 

The basic idea behind the method of moments, the subject of the present chapter, is to 
look for differential equations that are simpler than the CBE and describing the relations 
between particular functions defined as moments of the distribution function (DF) over 
velocity space (the Jeans equations) and over configuration space (the tensor and scalar 
virial theorems). All of the results presented in this chapter are derived in an inertial 
reference system So, and the convention of summing over repeated indices is used. 
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The starting point in deriving the Jeans equations for a system with a phase-space DF f 
is to consider a suitably choosen microscopic function defined over the phase space y, 
which we indicate generically with F = F(x,v;f). Intuitively, F is the mathematical 
expression of any physical property of interest that “knows” what a star at position x and 
with velocity v is doing at time rt; for example, it could be the position or the velocity of the 
stars, their kinetic energy or angular momentum, and so on. The associated macroscopic 
function F is then naturally defined as 


F(x;t) = F(x, v;t) f (x, V; t)d>v, p(x;t) = SRY; t)d>v, (10.1) 
aia 


p(x; t) Jx3 
where (x; f) is the material density of our stellar system at position x and at time ¢. From 
now on, a bar over a symbol will represent the operator given in Eq. (10.1); of course, if 
F is independent of v, then F = F. From a physical point of view, F can be interpreted 
as the average value (at time f) of F over all of the velocities of the stars that (at time f) 
contribute to the density p at x. 

In stellar dynamics, among all of the functions F that can be imagined, especially 
important are those leading to the so-called velocity moments. For example, up to the second 


order, 
Ui(x:t) = v; fav, (10.2) 
P(t) Jo3 
viv; (x;t) = jv; fay, 10.3 
0; 0; (x; t) ees vjuj fav (10.3) 
o2.(xit) = (vj; — (vj — 0) fd’v = 03, (10.4) 
" p(x3t) Ja3 a a 
where i, 7 = 1,2,3, v;(x;t) are the components of the streaming velocity field, and the 


(symmetric) tensor of (x;f) is the velocity dispersion tensor. Obviously, higher-order 
velocity moments are defined by considering higher-order products between the velocity 
components (e.g., see Exercise 10.6). Note that 


Of, = U0] — GV; (10.5) 


so that of can be negative fori # j, while the diagonal terms are necessarily positive. 
The reason for adopting the formal notation with a square in Eq. (10.4) is as follows: given 
the symmetry of oF under the exchange of i and j, an orthogonal rotation matrix R(x; f) 
exists so that at each point of the system the velocity dispersion tensor is diagonal in the 
new reference system, with only positive diagonal entries. In summary, of is a positive 


definite symmetric tensor. This leads to a simple geometric interpretation of oF: at each 
point x of a stellar system, it is possible to associate a velocity dispersion ellipsoid whose 


“surface” is defined as a function of the unitary versor n = n;e; by 


o7(x,n) = oninj, (10.6) 
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Figure 10.1 Schematic representation of the first three velocity moments of the phase-space DF f 
of a stellar system (1.e., the scalar density field, the streaming velocity vector field, and the velocity 
dispersion tensor). At Point 1 the system is isotropic, and at Point 2 some anisotropy is present. 


where n spans the whole solid angle around x. In general, the orientation and the axial 
ratios of the velocity dispersion ellipsoid change from place to place and with time inside a 
system (see Figure 10.1). A highly idealized but very important class of systems is that of 
isotropic systems — systems for which, at each point in space 


OF; (xt) = 07(x;1)6;;. (10.7) 


In this case, the velocity dispersion ellipsoid is everywhere a sphere. Of course, a stellar 
system can be isotropic just at some position, or in a limited region of space, or nowhere; in 
all of these cases we say that the system is anisotropic. As we will see, velocity anisotropy 
is a key concept in stellar dynamics. 


10.2 The Jeans Equations 


We can now move to derive the Jeans equations associated with a DF f and a total potential 
or. We first prove that the general differential equation (for the moment in Cartesian 
coordinates for simplicity) obeyed by velocity moments of F can be written as 


pF  dpF vu; ddr OF OF OF 
= alae ee 10.8 
at Ox; Pan Oe ae Oe ae 


In Eq. (10.8), dr = & + dext is the total potential, with 


ah if 
(xt) = —G / PO Be ie 6 i Pest) 89, (10.9) 
98 Ix—yll 9 Ik—yl 
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For example, o could be the stellar density of a galaxy component and /ext the density 
distribution of a dark matter halo or a massive gaseous component not described by the DF 
of interest. dex, could also contain a term describing the effect of a supermassive central 
black hole (BH). 

Equation (10.8) is easily obtained from the identity 


Lz — Fdv=0, (10.10) 
3 D 


a direct consequence of the CBE. In fact, from Eq. (9.9), the first term in Eq. (10.10) can 


be rewritten as 

r) Of F doF OF 

ray = [ “ef Gy a ee, (10.11) 
73 ot ie) ot ot ot 


because the time derivative and integration over v can be interchanged, being the coordi- 
nates v independent of t. The second term is 


r) OfF oF dp Fu; oF 
/ uctray= [ u( es )av= oe _ pu, (10.12) 
ns n3 


Xj OX; Ox; Ox; ' Ox; 


because the spatial derivative and integration over v can be interchanged, being v indepen- 
dent of x. Finally, the third term in Eq. (10.10) is 


ddr 0 ofF OF ddr OF 
i (oa =f ( i f )av= Sop oe (10.13) 
3 OX; Ov; OX; 73 Ov; Ov; OX; Ov; 


because ¢r is independent of v and we assume fF — 0 for ||v|| — oo (and so the 
integration over all of the velocity space of the exact differential with respect to v; evaluates 
to 0). Equation (10.8) is now proved by adding together Eqs. (10.11)—(10.13). 

The most common Jeans equations are now derived in Cartesian coordinates with some 
elementary algebra by choosing F = | and F = v; in Eq. (10.8): 


dp  dpvu; — Dp Ov; 
= ~0, 10.14 
a Ou DE om aon) 


APY; IPVIV] dgr : 
= ; = 12:3 10.15 
ot Ox; 2 OX; u ) ( ) 
(e.g., see Bertin 2014; Binney and Tremaine 2008; Chandrasekhar 1942; Ciotti 2000; 


Ogorodnikov 1965; Saslaw 1987). Note that with some work Eq. (10.15) can also be 
1 


written as 


Di; OV; tl _ o¢r 1 099; 
Di Bt "Ox; ~~ a; Ox; 


» (=1,2;3). (10.16) 


! Th the rewriting of the Jeans equations (10.14)-(10.16) we used the material derivative defined in Eq. (A.173), which in the 
present case reads D/Dt = 0/dt + ¥70/0x;. 
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Naturally, the Jeans equations can be extended to higher orders (and sometimes are; e.g., 
see Amendt and Cuddeford 1994; Magorrian and Binney 1994 and references therein) by 
considering F = vjvj;, F = vj v; ug, ete. 

Finally, a technical (but important) aspect concerning the Jeans equations is how they 
can be rewritten in curvilinear coordinates such as spherical, cylindrical, or ellipsoidal 
(often used in applications; e.g., see Chapter 13). A full discussion of the transformation 
of differential operators in the general context of continuum mechanics can be found in 
Narashiman (1993), and the student following this path will appreciate how the direct trans- 
formation of partial derivatives is not a trivial task. In particular, the student will appreciate 
the power of different approaches based on the use of the Hamiltonian formulation and 
canonical coordinates to express the CBE, or on the use of Reynold’s transport theorem in 
curvilinear orthogonal coordinates (see Chapter 9 and Exercise 10.2). 


10.3 Analogies with and Differences from Fluid Dynamics 


The student with some knowledge of fluid dynamics will certainly notice the striking 
similarities between the Jeans equations and the fluid dynamical equations of continuity 
and momentum in the presence of a gravitational field and viscosity and in absence of 
mass, momentum, and energy sources and sinks. The analogy is not by chance. In fact, 
the equations of fluid dynamics can also be derived by using Reynold’s transport theorem 
(Appendix A.10) with the fluid velocity u(x;t) = u;(x;f)e; as the basic vector field.? In 
the following, we also consider source/sink terms for later use in this chapter. 

We begin with the continuity equation for a fluid with density o(x;t) and mass source/ 
sink (per unit volume and time) term M(x; ft), obtained by imposing the mass conservation 
for arbitrary volumes Q(t) C S transported by the velocity field; in other words, by 
requiring that 


d 

= pax = Md>x, V Q(t). (10.17) 

dt Jaw) QU) 
The time derivative is carried out under the integration with the aid of the transport theorem, 
the integral at the right-hand side is carried out at the left-hand side, and from the require- 
ment that the resulting integral vanishes for arbitrary volumes, it follows immediately that 


Dp du; Op  dpuU; 
= = MM. 10.18 
Dt OX; ot " OX; ue ( ) 


Therefore, Eq. (10.18) shows that a fluid without sources/sinks is incompressible if and 
only if div u = 0. Incidentally, we notice that in fluid dynamics quite often one encounters 
the generalization of Eq. (10.17) where, instead of p, the product of the density and of some 


2 For remarkably clear discussions of the equations of fluid dynamics, see, for example, Aris (1989), Clarke and Carswell 
(2007), Currie (1993), Meyer (1982), Milne-Thomson (1996), Narashiman (1993), Pringle and King (2007), Shu (1992), and 
Tassoul (1978). For more physically oriented presentations, see the magnificient books of Batchelor (1967), Lamb (1945), and 
Landau and Lifshitz (1986). 
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property (per unit mass) of the fluid (temperature, velocity, internal energy, entropy, etc.) 
appears, and in Exercise 10.3 we derive a very useful lemma concerning these cases that is 
of frequent use. 

We now consider the i-th component of the momentum equation (also known as the 
Euler equation) that is obtained from the Second Law of Thermodynamics 


@ pu; d°x = -[ » BOT By +f pijnj x + Pi d*x, (10.19) 
dt Jaa) Q(t) PO anit) Q(t) 

where the three terms at the right-hand side describe the effect of volume and surface 
forces over the fluid volume Q (rf), respectively, and P(x; t) = P; (x; f)e; is the momentum 
source/sink (per unit volume and time). In particular, in the surface integral n = n;e; is the 


outward unit vector normal to the surface 0Q(t), and the function 
Pij (®t) = — pw 1)dij + Tj (10.20) 


is the stress tensor, where p(x;t) is the thermodynamical pressure and 1;; is the shear 
tensor.» According to Cauchy’s second law (e.g., see Truesdell 1991), pij 1S a symmetric 
tensor: note that from Eq. (10.21), in the hydrostatic case (i.e., u = 0), the stress tensor 
reduces to the thermodynamical pressure only, independently of the viscosity. Using the 
divergence theorem, the surface integral in Eq. (10.19) can be transformed into a volume 


integral 
Opi a 
7 pin [ Pi By = / P ax (10.22) 
XO Q(r) OX; Q(t) Oxi 


(where the last identity holds for inviscid fluids). Finally, from the lemma in Eq. (10.69) 
applied to the left-hand side of Eq. (10.19), one obtains for i = 1,2,3 


Du; Ou; Ou; OgT 4 1 Opij 4 Pi — Mu; 


Dt at aa? ax; |p Ax; p 


(10.23) 


Of course, one could write an equation for the angular momentum of the fluid analogous 
to Eq. (10.19) by starting from the appropriate equation of dynamics, and this is left as a 
(very important) exercise for the interested student (see also Exercise 10.5). 

We now momentarily stop to appreciate the striking similarities between the Jeans equa- 
tions (10.14)—(10.16) and the fluid dynamical equations (10.18)—(10.23). Note how the two 
sets of equations formally coincide in the absence of sources/sinks, provided we identify the 
streaming velocity stars v; with the fluid velocity u; and the velocity dispersion tensor of 
the stellar system po;; with the fluid stress tensor — p;;. Regarding the last point, note how 
from Eq. (10.21) the Roteigic velocity dispersion tensor in Eq. (10.7) would correspond 


3 For the so-called Newtonian fluids, 


du; Ouj duK 
ij + tA Sij, 10.21 
Tj =e ( dx; ax; ax, ij ( ) 


where jz and A are the first and second viscosity coefficients, respectively (e.g., see Truesdell 1984); in incompressible fluids, 
the shear tensor is independent of 1. 
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(prove it!) to the identification po* = p — (A + 21/3)divu; from a monoatomic perfect 
gas, Stokes’ hypothesis assumes 2 + 2;4/3 = O, and so an isotropic velocity dispersion 
would be formally identified with “thermodynamic pressure.” 

An obvious question remains open: Can the similarities exhibited by the continuity and 
momentum equations of stellar dynamics and fluid dynamics be extended to the corre- 
sponding energy equations? We answer this question by deriving first the energy equation 
for a fluid with heat conduction and energy sinks/sources (e.g., radiative loss/heating). The 
continuum formulation of the First Law of Thermodynamics reads 


d 2 F) 
— (z + ptt) ax = - | puj or x+ f Uj PijNj d’x 
dt Jaw) 2 Q(t) OX; IQ(t) . 


-f[ hinj ax+ [ =Da0% (10.24) 
dQ(t) Q(t) 


where E is the internal energy per unit volume, and on the right-hand side the work per 
unit time associated with the volume and surface forces is considered. The heat conduction 
is described by the heat flux vector h(x), and the minus sign in front of the surface integral 
is associated with the fact that n is directed outward from the control volume (2 (f). Finally, 
E(x) and £(x) are, respectively, the energy sources and sinks per unit volume and time. 
Repeating the treatment used for the continuity and momentum equations, and after a 
simplification taking into account Eq. (10.23), we finally obtain (prove it!) 

[lu |l 


DE Ou; JE OEu; Ou; oh; 
= = pij wp +E —L. (0.25 
be tn” Oe ae a, eer 


Note how volume forces do not affect (directly) the internal energy of a fluid, an aspect 
that is not always appreciated; we also recall that by using the same approach it would be 
possible to express the Second Law of Thermodynamics in a continuum formulation, even 
though we will not pursue the subject further (e.g., see Currie 1993; Meyer 1982). 

We now derive the energy Jeans equation for collisionless systems and we compare it 
with Eq. (10.25). For a system made of single stars, the quantity equivalent to the internal 
plus kinetic energy appearing on the left-hand side of Eq. (10.24) is given by the phase- 
space average of the microscopic function F = ||v||*/2, and in Exercise 10.6 three impor- 
tant properties of F are proved; in particular, Eq. (10.72) suggests that, for a stellar system, 
we should identify E = pTr(o;,) /2. Moreover, from Eq. (10.8), after some algebra, and 
taking into accout the symmetry of oF, and p;j;, one obtains 

dE dEvj DE OU; 4 Oj OPXi 


= E— = —po?2 10.26 
i Oy Dk yy Oe cee 


where the analogy with Eq. (10.25) is complete in the absence of source/sink terms and if 
one identifies the components of the heat flux vector h = h;e; with the third-order velocity 
moments (x; in Eq. (10.73). 
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Despite this striking formal analogy, a fundamental difference between fluids and stellar 
systems is apparent. In fact, while the equations of fluid dynamics are a closed set of 
equations because the pressure is related to p and E via the equation of state, the concept 
of the equation of state+ cannot be introduced from basic physical arguments in the Jeans 
equations. This means that the latter are an infinite set of equations. Thus, in order to solve 
the Jeans equations, we are forced to assume a (more or less physically motivated) closure 
relation. In applications, this is often done by specifying the properties of the velocity 
dispersion tensor, and due to its relevance for the modeling of stellar systems, the problem 
will be extensively discussed in Chapters 13 and 14. Moreover, in Exercise 10.2 we derive 
the generalization of Eq. (10.8) for curvilinear (orthogonal) coordinates, while in the next 
section we illustrate an astrophysical problem where both the Jeans equations and the 
equations of fluid dynamics play a fundamental role. 


10.4 Stellar Dynamics and Gas Dynamics in Stellar Systems 


A very broad class of astrophysical problems concerns the interesting situations where 
both stellar dynamics and fluid dynamics are simultaneously involved, such as the physics 
of galactic gas flows, of gaseous cold disks and hot coronae around disk galaxies, the hot 
intra-cluster medium in clusters of galaxies, the Bondi accretion on supermassive BHs at 
the center of elliptical galaxies, and so on. The link between stellar dynamics and fluid 
dynamics is not suprising: after all, in a stellar system, stars and gases share the same 
gravitational potential, as is apparent from Eqs. (10.16) and (10.23). The literature on 
these subjects is immense, and we cannot even attempt to give a partial covering of the 
most important references. Instead, we simply present a few representative cases aimed at 
introducing the student to such a wonderful branch of astrophysics. 


4 For example, a perfect gas is defined as any substance for which the equation of state 


_ kppT 
(u)mp 


= erg _94 
, ky =1.3810716 = Mp = 1.6710 24 9 (10.27) 


holds, where (jz) is the mean molecular weight, kg is Boltzmann’s constant, and mp is the proton mass. The student is 
recommended to distinguish between the concepts of the equation of state and of thermodynamical transformation. Some of 
the most important are the barotropic transformations and their subset of polytropic transformations of index y (the polytropic 
index), given respectively by 

p=po, p= op”, (10.28) 

0 

where po and po are normalization values for density and pressure; nonbarotropic fluids are called baroclinic. A very 
important subset of polytropics is represented by the reversible adiabatics, where y = Yaq = Cp/Cy is the ratio of the molar 
specific heats at constant pressure and temperature. As is well known (e.g., see eq. 25 in Chandrasekhar 1939), polytropic 
transformations are characterized by the constant molar specific heat 


Yad — Y 
l= 


Cooly =Cy ; (10.29) 


a negative number for | < y < yaq (see also Korol et al. 2016). 
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10.4.1 Hydrostatic Equilibria 


An important class of astrophysical problems where the interplay between fluid dynamics 
and stellar dynamics is particularly relevant is that of hydrostatic equilibria (i.e., the study 
of the admissible configurations of gases at equilibrium in the total potential well of stellar 
systems such galaxies or clusters of galaxies). This area is motivated by the hope of gaining 
information on the gravitational potential @r by observation of the gas properties. In fact, 
from Eq. (10.23), in the hydrostatic case, and in the absence of sources/sinks, 


Vp = —pV¢or, (10.30) 


where p is the gas pressure and p is the gas density. Following Poincaré (e.g., Tassoul 
1978), some general properties of the solutions of Eq. (10.30) can be easily derived. First, 
as Vp is parallel to V@r, it follows immediately that the gas pressure is stratified on 
isopotential surfaces (i.e., p = p(¢r)). Second, by taking the rot of Eq. (10.30), we deduce 
that Vp A Vér = 0 (i.e., isodensity surfaces coincide with isopotential surfaces), and so 
also p = p(¢r). Therefore, by elimination of the potential, we conclude that hydrostatic 
equilibria are necessarily barotropic (i.e., p = p(p)) independently of the specific equation 
of state of the gas (see Footnote 4). Finally, if our gas is perfect, then from Eq. (10.27) we 
obtain that T/(j) is also constant over isopotential surfaces (see also Appendix A.6). It 
is not a surprise then that from observations of the gas properties in systems at fiducial 
equilibrium, we can gain information on the total gravitational potential of the system 
(e.g., see Fabricant et al. 1980). 

The previous considerations show that for an assigned potential, an assigned equation of 
state, and an assigned stratification p(e), the integration of Eq. (10.30) can be easily per- 
formed (e.g., Ciotti and Pellegrini 2008). One of the most simple and important examples 
is represented by hydrostatic polytropic equilibria. In fact, it is a simple exercise (do it!) to 
show that by inserting the second identity of Eq. (10.28) into Eq. (10.30), one gets 


1 
—1\yr7 ae 
. ¢ ) OT = Eq)7=1 A(Wr — Eo), 


Po Boy 
T -—1 
= 1 (Wy — £9) (Wr — £0), (10.31) 
To Boy 
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(LL) Mp —1 
where YW; = —d¢y and the subscript 0 indicates the values of the various physical quantities 


at some prescribed position xo in the system. Notice that the solutions above for sufficiently 
low values of Jo can be spatially truncated on the surface Yy(x;) = € depending on the 
value of y and of the behavior of ¢r. A particularly important class of solutions is obtained 
in the isothermal limit y — 1, when €) — —oo, and the student is encouraged to prove 
that the solutions in Eq. (10.31) become untruncated, with 


vr—Vo 


p=poe % , T = To. (10.32) 


We will return to these solutions in Chapter 12. 
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Another significant class of fluid equilibria in gravitational fields of obvious importance 
when considering disk galaxies is represented by axisymmetric gaseous configurations in 
permanent rotation (i.e., their velocity field is written in cylindrical coordinates as u = 
ug(R, z)eg). In this case, Eq. (10.23) reduces to (prove it!) 

2 
Uo 
VE=—pvOrT hem (10.33) 


and again an important theorem from Poincaré states that the (rotating) equilibrium is 
barotropic if and only if the rotational velocity is constant on cylinders (i.e., ug is inde- 
pendent of z), and so an effective potential can be immediately introduced into Eq. (10.33) 
(see Tassoul 1978). Of course, if the density distribution and the gravitational potential are 
assigned, then the gas pressure is fixed by the vertical equilibrium condition and there is 
no reason to expect cylindrical rotation, so that the gas distribution is baroclinic, with the 
pressure not being constant over isodense surfaces. It is easy to prove (do it!) that for gas 
distributions that are sufficiently regular at infinity (the usual case) 


ur © / a0 a ap a 
=f Or 2 EOE) ae (10.34) 
R z OR dz’ = az’ OR 


Equation (10.34) can be used to study the effect of the shape of the gravitational potential 
of disk galaxies on the rotational field of their hot and rotating gaseous coronae, which 
present a clear dependence of their rotational velocity on z (e.g., see Barnabé et al. 2006 
and references therein). We will return to a consideration of Eq. (10.34) in Chapter 13. 

Another important combined use of the Jeans equations and fluid dynamics is obtained 
by elimination of the total potential from Eqs. (10.16) and (10.23) so that for the gas and 
stellar distributions necessarily 


Du; 1 ap — Di 1 Apso; 
Dt p OX; ~ Dt Px Ox; 


, (10.35) 


an identity that, among other applications, can be used to probe the assumption of hydro- 
static equilibrium, to gain information on anisotropy, and finally to test how departures 
from the assumptions can affect mass estimates of the stellar systems under consideration 
in a systematic way (e.g., see Ciotti and Pellegrini 2004; Mathews and Brighenti 2003b; 
Pellegrini and Ciotti 2006). 


10.4.2 Galactic Gas Flows 


An important astrophysical problem where both stellar dynamics and fluid dynamics are at 
play is that of the structure and evolution of the hot gaseous coronae observed in elliptical 
galaxies and in clusters of galaxies. The literature on the subject is extremely vast, and 
attempting of providing even of a very partial list of the results in the field devoted to the 
subject would be foolish at best: Kim and Pellegrini (2012) and Mathews and Brighenti 
(2003a) and the references therein represent very good and general overviews of the prob- 
lem and of the literature (see also Sarazin 1988 for the related problem in clusters of galax- 
ies). Here, we necessarily limit ourselves to one particular aspect: we show how the Jeans 
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equations are needed to specify the source terms in Eqs. (10.18), (10.23), and (10.25) 
describing gas flows. In fact, the important information here is that the stars in a galaxy 
inject gas as they evolve (on the main sequence, as red giants, on the asymptotic giant 
branch, exploding as supernovae, etc.) and that the momentum and energy of the stellar 
ejecta substantially affect the structure and evolution of the resulting X-ray-emitting coro- 
nae, playing a central role in the cooling flow model and all of the associated variants of 
galactic winds, with or without active galactic nuclei feedback from accretion on the central 
BH (e.g., see Ciotti and Ostriker 2012; Ostriker and Ciotti 2005 and references therein; see 
also Ciotti and Pellegrini 2017, 2018). 

Motivated by the previous discussion, let us now focus for simplicity on a simple stellar 
population in a galaxy with internal dynamics described by a phase-space DF f (x, v; r); 
the generalization to the case of more than one stellar component is straightforward in the 
context of extended DFs. For the present purposes, it is useful to adopt the interpretation 
of the DF in terms of the number of stars per unit volume in phase space (i.e., we assume 
that n(x;t) = Sra fd 2 v). As each star will spend different amounts of time in each of the 
different evolutionary phases in the following, we only concentrate on some unspecified 
evolutionary phase of stars of a well-defined mass; as far as the whole stellar population 
is described by the same phase-space distribution, the dynamics of each population are the 
same and the contributions to mass, momentum, and energy injections during the different 
phases can simply be added to determine the total source terms. In the more general case, 
the extended DF formalism should be used. 

In all generality, let m = m(x,v,n;f), the mass return per unit time and unit solid angle 
along the direction n associated with each star, where n = 77;e; is a unitary vector measured 
from the center of the star. In principle, the dependence of m on n would account for the 
possibility of anisotropic mass losses, while the dependence on v could describe some 
dependence of the stellar population properties on phase space (e.g., a counter-rotating 
disk of stars with different metallicity). The total mass return per unit time, volume, and 
solid angle associated with the stars at x is indicated as 4(x,n;f) = Sos m f d?v, so that for 
mass sources independent of the source velocity v (the obvious situation), (x,n; ft) = nm. 
Integration over the solid angle finally gives the total mass return rate per unit time and 
volume at x 


M(x; t) = wd-n = 40m, (10.36) 
An 


where the last expression holds for isotropic mass losses. 

The momentum return associated with each star is given by the vectorial function p = 
m(x, V,n; t)[V+ u(x, V,n;7)n], where us = ||us|| is the modulus of the velocity of the ejecta 
along n measured with respect to the star’s velocity v. The total momentum injected per 
unit time, volume, and solid angle from the considered stellar population at x is then given 
by a(x,n;f) = to p f d>v, so that for mass losses and ejection velocities independent of 
the specific stellar velocity v, we have m(x,n;t) = nm (V + usn), where V is the streaming 
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velocity field at position x in the galaxy, as is given by Eq. (10.2). Integration over the solid 
angle finally gives the total momentum injection rate per unit time and volume at x 


P(x:t)= i xd’n= My, (10.37) 
An 


where again the last expression holds if m and us are also isotropic. In practice, the amount 
of injected momentum in the case of isotropic mass losses is simply proportional to the 
local streaming velocity of the stellar population. 

The energy source associated with the stellar mass losses is made from the contributions 
of two distinct parts: the internal energy source and the kinetic energy source. The internal 
energy source (energy per unit time per unit solid angle per unit mass) is described by the 
function e = e(x,v,n;1f). The total internal energy injected at x in the galaxy per unit time, 
volume, and solid angle is then given by €(x,n;f) = i me f d>v, so that for mass and 
internal energy sources independent of the stellar velocity v, €(x,n;t) = je. Integration 
over the solid angle finally gives the total inernal energy injection rate per unit time and 
volume at x 


Eint(X; 1) =i ed’n= Me, (10.38) 
An 


where the last expression holds if m and ug are isotropic. The kinetic energy injection due 
to a given star is given by k = 5m (x, v,n;f)||V + us (x, V, n; t)n||7, so that the total kinetic 
energy per unit time, volume, and solid angle associated with the stars at x is k(x,n;t) = 
Soya k f dv. In particular, for mass losses and ejection velocities independent of the stellar 
velocity v, Eq. (10.72) proves that «(x,n;f) = nm (||¥||? + Tro? + u2 + 2us(n,¥))/2 
where Tro? is the trace of the velocity dispersion tensor in Eq. (10.4) associated with the 
stellar population. Finally, integrating over the whole solid angle, 
Iv? 2 mer) 
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where the last expression holds if m and us are isotropic because the angular average of n 
appearing in the expression of « vanishes. 

In summary, in a stellar system where the aging stars inject (isotropically) mass, momen- 
tum, and energy, from Eqs. (10.23)—(10.25), restricting to the nonviscous case, and finally 
using the expressions just derived for the source terms, the momentum and energy equations 
for the interstellar medium are (prove it!) 


Du; ddr ldap M 
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(10.39) 


Ui), (10.40) 


jis (10.41) 


DE Ou; uw ‘Tro? |jv—ull? ah; 
tip = 5 
pe 1 Et Pay, (+34 2 13 Ox; 


where the dependence of the source heating term from the kinematical fields ? and ¥, 
determined by stellar dynamics, is apparent. In fact, we can illustrate the impact of stellar 
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Figure 10.2 Schematic representation of stationary gas flow in the presence of stellar source terms, 
as described in Eq. (10.43). The dotted line represents a stagnation surface defined by u = 0 and the 
gas is outflowing at dQout at the constant rate Mout and inflowing at 0Qjn at the constant rate Min. 


dynamics on the energetics of the flows by considering the special and very idealized case 
of a time-independent, nonconductive gas flow in a galaxy and integrating Eq. (10.75) over 
a time-independent volume (2 bounded by two isopotential surfaces OQin and OQout as in 
Figure 10.2. Moreover, we also assume that a stagnation surface (i.e., a surface over which 
u = 0) OQstagn exists, so that the gas must be on average spatially outflowing over dQout 
and on average spatially inflowing toward the center over 0Q2in in order to ensure time 
independence.> Under these assumptions, volume integration over the two regions Qout 
and Qin finally shows that (prove it!) 


w . VP . ‘Tro? 
Lat : a 
[ x [om(e+ 5) + 5) + 5 ) x 


2 
= i (wi G +p m ) wn) ax — Lin + Lit (10.43) 


where 


L eray =f M (Pout — $)d°x, ba [. M ($ — ¢in)d?x, (10.44) 


5 From the continuity equation and the definition of a stagnation surface 
Mout = i; Max = I p(u,n)d2x, (10.42) 
Q, IQout 


and a similar indentity holds for the inflow region Qj, (see Figure 10.2). 
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so that it should be clear by now how the internal dynamics of stars and the gravitational 
potential of the galaxy deeply affect the energy balance of the gas, determining its evolution 
(e.g., see Ciotti and Pellegrini 1996; Ciotti et al. 1991; Negri et al. 2014a,b; Posacki et al. 
2013 and references therein). As a nice exercise, the student is invited to derive the balance 
equation for the case of a global outflow and a global inflow. 


10.5 The Tensor Virial Theorem 


The Jeans equations are invaluable tools, especially when modeling stellar systems. They 
also contain important information that is not immediately accessibile without some addi- 
tional work. In fact, after averaging the CBE over the velocity space, a further (and equally 
fundamental) step can be taken: as is well known, it is possible to obtain exact relations 
(actually, an infinite number of tensorial identities) from integration over the configuration 
space of the Jeans equations after multiplication with suitable functions of the spatial coor- 
dinates. The resulting set of identities is the tensor generalization of the Lagrange—Jacobi 
identity in Eq. (6.10); note that we already obtained an analogous result in the context of 
the N-body problem in Eq. (6.47). Moreover, by reducing to equilibrium configurations we 
will obtain the tensorial virial theorem (TVT) a generalization of the scalar virial theorem 
(SVT), derived in Chapter 6 (e.g., see Binney and Tremaine 2008; Chandrasekhar 1969; 
Ciotti 2000). In the following, we will limit ourselves to the second-order virial theorem, 
which is by far the most used in applications. All quantities appearing in the discussion can 
be time dependent in general, but for simplicity time dependence will not be indicated if it 
is not strictly required in order to avoid possible ambiguities. 

First, for a stellar system described with the Jeans equation and with a density distribu- 


tion (x;f), we introduce the symmetric second-order mass tensor® 


n= f pxixjd°x, I = Tr(iiy). (10.45) 
n3 , , 


Second, three other manifestly symmetric second-order tensors measure the kinetic 
energy of the system, namely 


l 1 
Kij = 5 | ome T;j = 5 | emaas, Tij = | eoja's, (10.46) 


and they are usually named as the fofal, streaming, and velocity dispersion kinetic energy 
tensors. Their trace is indicated by 


K=Tr(Kij), T =Tr(Tjj), OM = Trdj;), (10.47) 
and so from Eq. (10.5) 


(10.48) 


6 Other second-order mass tensors that are important in stellar dynamics and that can be expressed as linear functions of Jj ; 
have been encountered in Eq. (2.89). 
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Third, from the general results on the N-body problem illustrated in Chapter 6, we 
know that in the Lagrange—Jacobi identity and in the SVT, terms related to gravitational 
energy appear, and so we now look for a tensor generalization of this term. We recall that 
two scalar functions describe the gravitational energy Uy of the density distribution p in 
the total potential ¢r = @ + dext, where ¢ is the gravitational potential produced by the 
component p and dex; is the gravitational potential produced by external densities (ext (€.g., 
a central BH, a dark matter halo, a massive gaseous component, and so on). In particular, 
for a system for which we can set dr = 0 at infinity, the gravitational self-energy is 


1 G st t 
u=-| pddx= / PUP O py. (10.49) 
2 Jy 2 Jno Ix— yll 
while 
it of 
Vext = i pdexrd°x = c| PUG t)pextl¥ Bea (10.50) 
93 x (Ix —yll 


is the gravitational external energy; note that the factor 1/2 appears only in the expression 
of self-gravity for the reasons illustrated in Chapter 6. In Exercise 10.9, a very elegant field 
expression for U is derived. 

Fourth, remarkably (and unintuitively), another second-order tensor (which we call the 
interaction energy tensor) can be associated with the gravitational energy of the density p 
in the total potential dr as follows: 

dp 


Wi = -{ px; dx = Wij + WE", (10.51) 
i 73 Ox; 7 


where the meaning’ of W;; and wes follows immediately from ¢p = ¢ + ext. Therefore, 


Wr = Tr(W;;) = — / p(x, Vor)d?x = W + Woext. (10.52) 
: Diss 


Now, Wj; is related to U by a beautiful identity. In fact, in Exercise 10.10 we will prove 
that 
G (i — yi) — VP POSH pe(YsD) 
2 Ja6 Ix—ylP 


Wij = d°xd"y, (10.53) 


so that not only is W;; manifestly symmetric (a property not evident from its definition), 
but also 


U =TH(Wij) = -{ 
4 


Mx 


Wate Vo)d>x. (10.54) 


Again from Exercise 10.10, the student can convince themselves that, at variance with W;;, 
the tensor wes in general is not symmetric. 


7 Note how we is strictly related to the gravitational torque in Eq. (3.18), with Nj = —€; jx Wie 
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We can now move to derive the Lagrange—Jacobi identity in Eq. (6.10) (see also 
Exercises 6.4—6.6) in tensorial form, and from that the second-order tensor and scalar virial 
theorems for a stellar system possibly interacting with some other mass distribution 


12h; Wt Wes 1dr 
2 dt 2 ° 2d? 
This is arguably one of the most important results due to its numerous applications in 
theoretical and observational works in stellar dynamics. As the scalar identity is a trivial 
consequence of the tensorial identity obtained by its trace, and due to the fact that the asso- 
ciated virial theorems are obtained simply by considering the equilibrium configuration, 


our attention will be directed toward proving the first identity in Eq. (10.55). 
First, notice that from Eqs. (10.14) and (10.45) 


= 2Kij + Wij + =2K+W4+ Went. (10.55) 


dljj dp 3 OPUK 3 = 5 
7 = E Bp tixid x = -{, = xjxj@x= [. p (xj0j7 +xjUi)d°x, (10.56) 


where the last identity follows from integration by parts over x, and assuming the vanishing 
of the finite term at infinity. We now multiply the momentum in Eq. (10.15) by x; and we 
integrate over all of the configuration space: the two terms on the left-hand side become 


apt d 
/ go das | pxyd?s, (10.57) 
73 ot dt 73 
0 WD: 
i ip ee - owugd?x = —2Kix, (10.58) 
73 OX; 73 


while on the right-hand side we have Wi. A comparison of the identity obtained from 
symmetrization with respect to the indices with the time derivative of Eq. (10.56) finally 
establishes Eq. (10.55). 

In summary, we obtained the Lagrange—Jacobi identity in tensorial form for a stellar 
system described by the Jeans equations, and these equations in turn have been derived from 
the CBE (i.e., in a collisionless regime). Thus, it is legitimate to ask whether Eq. (10.55) 
holds only in these special circumstances. The answer to this question is provided in 
Exercises 6.4—6.6, where the Lagrange—Jacobi identity in tensorial (and scalar) form is 
proved for a system made of N point masses moving in their potential and in an exter- 
nal potential @ext by using the Newtonian equations of motion in Eq. (6.1). Therefore, 
Eq. (10.55) actually holds (with the kinetic energy tensor K;; = Tj; ) independently of any 
assumptions about collisionality. 


8 In Exercise 10.18, we elaborate on this point. In fact, the second-order mass tensor and the inertia tensors diverge for several 
density distributions commonly used in stellar dynamics, such as those in Eqs. (2.63), (2.83), (2.93), and (5.64), as well as 
many others that will be encountered in Chapter 13. For these systems, we will discuss the behavior at infinity of the discarded 
integrals and obtain the needed generalization of the Lagrange—Jacobi identity. 
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10.5.1 A Note about the Virial Theorem in Multicomponent Systems 


Particular attention should be paid to avoiding the incorrect (and unfortunately not 
uncommon) application of the SVT to multicomponent systems. Let us imagine arbitrarily 
decomposing a self-gravitating system of density p and potential ¢ into two parts with 
p = pi + p2 and d = ¢; + ¢2; due to the additive nature of kinetic energy and the 
mass tensor, it follows that K = K, + Kz and J = J; + J). From the second identity of 
Eq. (10.55), the SVT for the total system is 


F 
a= 2K +U, (10.59) 
where 


1 
U=5 | pbd'x =U) +024 Ui2 = Ui + Ua tau, (10.60) 
mn 


and we indicate with U; and U2 the gravitational self-energies of the two components, 
while 


Ui2=[ pidrd’x, Uri= fl prbid’x (10.61) 
Hie Hie 


are their interaction energies. Note that we also have Uj,2 = U2; from the reciprocity 
theorem, which is immediately proved from Eq. (10.50). The correct application of the SVT 
to each component is obviously 1/2 = 2K, + U; + W,,2 and Ih/2 =2K2+ U2+ Wo1, 
where 


Wi,2 = = [eux Vor)d*x W21 = -{ p2(x, Vo1)d>x. (10.62) 
Diss Diss 


An incorrect application of the SVT would be to naively assume that /, /2 = 2K,+U, + 
U,,2 (i.e., that in virialized multicomponent systems twice the value of the kinetic energy 
of each component sums up to the negative of the gravitational energy of the component), 
instead of the correct expression 2K; = —U, — W),2. The error becomes apparent if we 
proceed to add the two incorred relations, thus obtaining lee = 2K,;+2K2+U,;+U2.+ 
Ui,2 + U2,1, with twice the contribution of the mutual gravitational energy when compared 
with the correct Eq. (10.60). Obviously, by adding the correct expression we recover the 
correct tensor virial theorem, and the student can prove this by using the same approach in 
Exercise 10.10 to show that 


Wi,2 + W211 = U21 = Uj,2. (10.63) 


10.5.2 The Potential Energy Tensor for Ellipsoidal Systems 


We conclude this chapter, giving the expression for the tensor W;; of ellipsoidally stratified 
density distributions p = o,0(m) as defined in Eq. (2.28). The resulting formula, albeit 
limited to this class of models (which certainly do not encompass the variety of structures 
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shown by real stellar systems), has been of great importance for our understanding of ellip- 
tical galaxies, particularly for proving that in many cases elliptical galaxies are not properly 
modeled by classical ellipsoids flattened by rotation (Chandrasekhar 1969), but rather are 
systems whose shape is also determined by anisotropy of the velocity dispersion tensor 
(e.g., see Binney 1978; Binney and Tremaine 2008 and references therein). In addition, 
other important quantitative information can be obtained from W;;, particularly in relation 
to stability problems (e.g., see Bertin 2014; Chandrasekhar 1969; see also Exercise 13.33). 

In fact, it can be proved (see Roberts 1962) that W;; = 0 fori # j, while the diagonal 
terms can be obtained from the quite remarkable formula 


CO 
Wii = —Gma\ana347 w; i [AW(m)]° dm, (10.64) 
0 


where no summing over repeated indices is intended, AW (mm) is defined in Eq. (2.22) with 
Xo = © and m(x,t) = m, and the dimensionless coefficients w; are given in Eqs. (3.12), 
(3.13), (3.62), and (3.63). Notice how the ratio of the two elements W;; / W;; is independent 
of the specific density profile of the ellipsoid and only depends on the axial ratio! Now, from 
Eq. (10.53), it is easy to prove (do it!) that for obvious symmetry reasons the off-diagonal 
terms of the potential energy tensor vanish identically, and in Exercise 10.19 we prove by 
direct computation that Eq. (10.64) in fact gives the correct result in the special case of the 
constant-density ellipsoid. 

Finally, with the aid of the expansion in Eq. (3.13) for small flattenings and some careful 
algebra, we can prove (do it!) that at the linear order in the flattenings 


GM? oo 
U = Wir + War + Was ~ — (: ae 5 “) f [AW(m)|2dm, (10.65) 
a) 0 


where AW = AW / Pn is the function associated with p in Eq. (2.28). As a useful exercise, 
the student should prove the result in Eq. (10.65) by direct integration of the formulae 
obtained in Chapter 2 regarding homeoidal expansion up to the linear terms in the flatten- 
ings for the product pd@. 
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10.1 In the so-called extended DF modeling, stellar systems can be described as a sum 
of a finite number of different DFs (e.g., distinguished by stellar age, metallicity, 
kinematical properties), or even as the integral over some parameter of a DF depend- 
ing on the parameter itself (e.g., see Chapter 8 for some examples). Prove that in a 
multicomponent system described by a discrete set of DFs f;, the total macroscopic 
quantity F is given by the mass weighted sum of the values of each component, i.e., 


(10.66) 
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10.2 


10.3 


10.4 
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Then prove that the total streaming velocity and total velocity dispersion tensor are 
expressed in terms of the analogous quantities for each component fi as 


we LAPD fn _ VPRO DE | Dy mee — WILT — V 
eee Dk Pk yx Pk 


(10.67) 


Expand and further simplify the last term in the expression for Of conclude that 
in multicomponent systems we can have a nonzero velocity dispersion even if the 
velocity dispersion of each Agaanes is zero and explain physically the result. 
Hints: In the expression of oj; , rewrite each term of the sum over the different phase- 
space components fx as (vj — Tj Vi)(v; — Vi) fe = Vi -— M+ We —Vllvj — WP e+ 
(Vj )x — Uj] fx, expand and perform integration over the velocity space. 


Derive the general Jeans equation in curvilinear orthogonal coordinates and veloci- 
ties (q, v), analogous to Eq. (10.8). First, consider a macroscopic function F(q, v;t) 
and a phase-space DF f(q,v;f), and from Eq. (10.1) define the quantity F'(q; 1), 
recalling that at fixed q, d*v = dv (see Exercise 9.2). Second, repeat the treatment 
in Section 10.2 for the CBE in curvilinear coordinates in Eq. (9.20), with f’ = f. 
Finally, consider the common case of F'(v; ft) and show that 


= 3 — 3 os 3 — 
d0F 1 00 F 1; Oj jk OF V; iV p 0dr OF oF 
Ot 12, gi hj yas jy, OF 
i=l i,j.k=l pag eh ae 
(10.68) 


Show that for a fluid of density p, the identity 


d ' Dh 
S| aha x= | (0 Fre mn) ax (10.69) 
dt Jaw) Q(t) Dt 


holds, where h is a generic property (per unit mass) of the fluid. Hint: First apply 
Reynold’s transport theorem, use Foonote 6 in Chapter 9, and finally use Eq. (10.18). 
With this exercise, we pay due hommage to Archimedes (287-212 BCE). First, 
from Eq. (10.69), in the absence of sources/sinks, show that for a fluid with density 
distribution o and mass M = Ie (t) pd>x over some region Q(t), the acceleration of 
the center of mass is given by 


2 
MAcm = ‘ / pxd°x = a pud?x = i p ear (10.70) 
dt? Jew Qt) aa) Dt 
Then consider Eq. (10.19) in the hydrostatic case and in the absence of sources/sinks, 
and further restrict yourself to the case of a uniform gravitational field; from the 
divergence theorem, prove that the total force enacted by surface (pressure) forces 
on dQ is the opposite of the weight of M. Finally, suppose that we substitute the 
fluid in the volume with a body of the same shape and deduce Archimedes’ law of 
hydrostatics. 
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10.5 Prove the validity of Eqs. (3.14)-(3.16) for a generic fluid system described by Eqs. 
(10.18) and (10.23) in the absence of source/sink terms. Hints: To prove the first 
identity in Eq. (3.14), assume p = px, in Eq. (10.70) and Q(t) = x3. Then use 
Eq. (10.23) in the absence of sources/sinks and consider p;; = 0 for ||x|| — oo. 
Regarding the volume forces, a symmetrization procedure analogous to that used to 
prove Eq. (10.53) shows that the integral contribution of the self-force is zero from 
Newton’s Third Law of Dynamics, so that the only remaining contribution to the 
integral is due to the external field g = —V@ext, and this completes? the proof. For 
Eq. (3.16), again from Eq. (10.69) in the absence of sources/sinks applied to the first 
identity in Eq. (3.15), one proves immediately the second identity, where 


Du 3 
N= (x — xo) A — pdx. (10.71) 
73 Dt 


Again use Eq. (10.23) in the absence of sources/sinks: integration by parts and the 
Cauchy symmetry of p;; proves that the contribution of surface forces vanishes. 
Moreover, symmetrization of the integral involving self-forces again shows that the 
contribution is zero, leaving external forces g as the only contributors to the torque, 
completing the proof. 

10.6 This exercise is relevant for the comparison of the Jeans equations and the equations 
of fluid dynamics. Starting from the microscopic function F = ||v||?/2, prove that 


2 2, == 
Iv? Tro) OF 
2 2 ° av; 


2 = =r = 
_| Tr) (WII (v; — %) lv — VII? 
5) + > x= . 


F= 


= (10.72) 


2 


Fu = xi + og0j + Ui 5 (10.73) 
10.7 By using Eq. (10.30), show that for a perfect gas at equilibrium in a spherical 
potential @r(r) the temperature and density profiles of the gas are related to the 
total enclosed mass My(r) via Newton’s second theorem as 
kpT(r)r?2 [dinp(r)  dinT(r) 
= : 
(L)Mp dr dr 
10.8 With this exercise, we elaborate on the energy balance of idealized, stationary gas 
flows in galaxies. Show that in the stationary case Eq. (10.41) can be rewritten as 


Mr(r) = (10.74) 


2 2 Ty2 aT 2 
£ = —div [ns (a+06+ 0 )ul+u(erS+o+ 5h > 
(10.75) 


where H = E+ p = Eyad = PYad/(Vaa — 1) is the enthalpy per unit volume of 
the gas and h is the heat conduction vector. Hints: Use Eqs. (10.18) and (10.40) to 
obtain the expression for (u, Vp). 


2 Note that the same result holds if Q(t) is finite and Pij = 0 over 0Q(t) vanishes. 
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10.9 


10.10 


10.11 


10.12 
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Let ¢ be a self-gravitating density distribution of total mass M defined over a region 
Q bounded by 0&2 with normal n and vanishing outside. By using the Poisson 
equation Ad = 4 Gp and the identity (verify it!) 


PAG = div (¢ grad) — ||grad |, (10.76) 


prove that Eq. (10.49) can be rewritten as 


1 
-aa f |Vpl|?d°x + ao fo (Von (10.77) 


(see also Jackson 1998). Moreover, if 0&2 is an equipotential surface, prove that the 
surface integral evaluates to #(9Q)M /2, and finally that if Q = 9° and @ > 0 for 
||x|| — oo, then the surface integral vanishes. 

Prove that the self-interaction tensor W;; can be written in the manifestly symmetric 
form of Eq. (10.53). Hint: From Eq. (10.51) 


| Xi (xj — y/) P(X 1 ply; t) 
Wi j= 3 

m3 IIx — yll 

and the exchange of x and y obviously does not affect the value of the integral. By 
adding the two expressions, we obtain the desired result. Note that the exchange of 
x and y does not affect either Wess however, in general, Wr i is not a symmetric 
tensor due to the fact that, in general, o 4 (ext. What happens if 0 = ApPext? 


d°xd°y, (10.78) 


Generalize Eq. (10.53) for a system of total mass M governed by additive r~® forces 
as in Eq. (1.13) and show that 


(w@—1)Ug, a #1; 
Wa=) GM? 
a 
where Uy is the self-energy obtained from Eq. (2.126) under the assumption that 
¢(xo) = 0 and xp = & fora > | and xp = 0 fora < 1. Compare the result with 
that in Eq. (6.44) for a system of point masses and explain why W, in Eq. (10.79) 
is not coincident with W in Eq. (6.42) with A = —G, B = 0, anda = 1. Hints: 
Consider the different role of the self-interaction energy for the point masses in 
Eq. (6.41) and for the density elements in Eq. (10.53). 
Consider the deep modified Newtonian dynamics limit in Eq. (2.123) for a system 
of total mass M. Show that 


(10.79) 


a=l, 


2 
GaoM3, w= 3 GayM (10.80) 


independently of the shape of the system so that from the Lagrange—Jacobi identity 
in Eq. (10.55) the changes of the moment of inertia of the system are only associated 
with changes in the kinetic energy, in analogy to the case of the a = | force 
in Eq. (10.79). Hint: Consider Eq. (10.52) for the potential @)4 and express the 


10.13 


10.14 


10.15 
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density distribution in terms of the potential from Eq. (2.123). Following Nipoti 
et al. (2007), show that the integrand can be rewritten as 


3 
avout | (10.81) 


D(gm) div (||Vom ll Vom) = div | Peon Von Von = 


where D = (x,V). Finally, use the divergence theorem for the integration volume 

extending to infinity and evaluate the surface integral from Eq. (2.125). 

Consider a razor-thin disk as in Eq. (1.18) with a surface density &(R) so that the 

self-gravitational energy U converges. From Eqs. (10.49) and (10.54), prove that 

0¢(R,0) 
R 


v=nfo E(RP(RORAR = 20 x(R) R?dR 
0 0 


= —27 [ X(R)uz(R)RdR = —207°G [ bwra. (10.82) 
0 0 


The student is encouraged to prove the second identity without resorting explicitely 
to the expression of W. Hint: The last integral in Eq. (10.82) is immediately estab- 
lished by inserting the first identity in Eq. (2.105) evaluated at z = 0 to the first 
integral, inverting the order of integration, and evaluating the resulting Hankel trans- 
form. 

Consider a galaxy with a density distribution p,.(x) and total mass M,, hosting at its 
center a supermassive BH of mass Mpqy and potential gy. We focus on the stellar 
component so that @gH acts as an external potential. Show that, independently of 
the galaxy shape, 


ase i pa, Voon) x = / _ px gund?x = Use, (10.83) 
HR? HP 


therefore in this very special case Wext = Uext. 

Consider again the stellar system of Exercise 10.14, now embedded in an exter- 
nal potential produced by the ellipsoidal generalization of the singular isothermal 
sphere in Eq. (2.63), which with abuse of language we call a singular isothermal 
ellipsoid 


PSIE = (10.84) 


qyqzm?" 


where we follow the convention in Eqs. (2.26)-(2.28). By using Eqs. (2.21) and 
(2.22), show that, independently of the shape of px, 


Wsie = — / _ Px (& Vosie)d?x = 27 Goya} wo(0)Ms = —veMx, (10.85) 
Diss 


where wo(0) is given in Eq. (2.67) and the last identity holds for the axisymmetric 
case (qy = 1), with ve being the constant circular velocity in the equatorial plane of 
PSIE, as obtained from Eqs. (5.64) and (5.65) evaluated for y = 2. 
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10.16 Show that for a generic density distribution p, embedded in an external triaxial 
logarithmic potential 


biog = veInm, Wig = —v2M,, (10.86) 


where v2 


the axisymmetric case) and m is defined as in Eq. (2.8). Convince yourself (by direct 
calculation or by using the result from Exercise 2.9) that the potential of the singular 
isothermal ellipsoid is not ¢jog, even though in the equatorial plane they coincide in 
the axisymmetric case (prove it!). What happens at the spherical limit? Conclude 
that in the last case Eqs. (10.85) and (10.86) concide with Wsis of o, embedded in 
the external potential of the singular isothermal sphere of rotational velocity v¢. 

10.17 Consider a stellar system with density p, and streaming velocity field v;. Then 
consider an arbitrary volume (2(t) of the system, transported by the streaming 
velocity. Finally, let Q(t) be the boundary of the region and n its unit normal. 
Starting from Eqs. (10.14), (10.16), and (10.45), show that 


is a scale potential (reducing to the constant squared circular velocity in 


I 
ao 2K+W+Wer—-P, P=Tr(Pij), (10.87) 


where all of the quantities are intended to be evaluated! over Q(t), and in particular 
1 2 2 2 
Pij = 5 ae Pa (OjpXj + Oj _Xi )NKAX. (10.88) 
If px0;, is constant on 0&2, show that 


P = Volume[Q(t)] x pxTr(o;;) (10.89) 


IQ) 
Obtain the fomulae for the case of a system with null streaming velocity and a 
fixed control volume and derive the SVT for a fixed region Q in a stellar sys- 
tem at equilibrium (see also Exercise 10.18). Notice that the case in Eq. (10.89) 
happens, for example, in the case of a spherical region &2 centered on the center of 
a spherical stellar system such as those we will encounter in Chapters 12 and 13 
(see Exercise 13.21). An important application of the virial theorem with boundary 
terms can be found in section 8.2 of Binney and Tremaine (2008). Hints: Use 
Reynold’s transport theorem (Appendix A.10) with v; as the velocity field to com- 
pute the second-order time derivative of J;; defined over the region (2(r) by using 
Eq. (10.69). From Eqs. (10.14)-(10.16) and the divergence theorem applied to vol- 
ume integrals of quantities such as X49 (Px) /Oxk = 0 (px05,Xi) /OXk — P x07. 
obtain Eqs. (10.87) and (10.88), the generalizations of Eq. (10.55). For Eq. (10.89), 
again use the divergence theorem to show that 


10. Of course, the potential used to compute W also contains the contribution from the part of the system outside Q(r). If the 
system is included and 0Q(t) is the boundary of the stellar system, then W = U. See also Exercise 10.18, where the inner and 
outer contributions are made explicit. 
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i xjngd°x = Volume[Q (1) ]dix. (10.90) 
AQ(t) 


10.18 As anticipated in Footnote 7, the validity of the Lagrange—Jacobi identity as a tool to 
prove the virial theorem can be legitimately questioned in the case of density profiles 
with a divergent second-order mass tensor. With this exercise, we elaborate on this 
point. For a given density profile p, fix a time-independent closed control volume (2 
and define p' and p™ as the densities inside and outside Q, respectively. Repeat 
the treatment in Section 10.3 and derive the generalization of Eq. (10.55) for the 
density p'". In particular, show that for time-independent systems 

out out ext ext 
al Dx (x;0j0E + xj WUE) @x= AY an wi +4 Wij a We We _ ge 
aQ , 


2 
(10.91) 


where weet and wre are the interaction energy tensors of p'" due to the potentials 
go and ¢', respectively. The standard virial theorem can be recovered case by 
case, showing the existence of the limit of the various quantities for 2 — oo (see 
again Exercise 13.21). 

10.19 Verify with explicit integration that for the constant-density ellipsoid obtained from 
Eq. (2.74) with p(m) = po, the self-interaction tensor W;; in Eq. (10.64) agrees 
with the expression in Eq. (10.51). Hints: Compute the integral in Eq. (10.51) with 
the potential #(x) at the interior points of the constant-density ellipsoid obtained 
from Eq. (2.75) and the potential derivatives given by Eq. (2.72) and integrate over 
the ellipsoidal volume. Evaluate Eq. (10.64) with AW(m) = 2 i to(t)dt from 
Eq. (2.75) and compare the results. 

10.20 We now use the SVT to estimate the average orbital times (Porb) of stars in a self- 
gravitating ellipsoidal galaxy in the limit of small flattenings. Of course, our defi- 
nition is necessarily arbitrary (but not unreasonable), and several others analogous 
definitions can be easily imagined.!! The point here is to obtain a robust order-of- 
magnitude estimate based on the virial theorem. First, prove from Eq. (6.26) that 


bP sie Ary _ 4GM,!? = 64/2 1 €x + Nx ay (10.92) 
ay ~«C SC 2 GM, 


where U = [oo (aw (m)|*dm; explain physically the dependence of (Po;b) on the 
flattenings. Second, evaluate U for the ellipsoidal generalization of the y-models, 


the isochrone model, the perfect sphere model, and the Plummer model and in the 
spherical limit verify the equivalence with the values of U calculated directly for 
the parent spherical models in Exercises 13.6, 13.7, and 13.17. 


1 For example, the student is invited to compute (and compare) the periods of radial and circular orbits in truncated power-law 
spherical systems such those in Eq. (13.29) for different values of the density slope y. 
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10.22 


10.23 


The Jeans Equations and the Tensor Virial Theorem 


Use Exercise 3.10 to estimate the libration frequencies of an elliptical galaxy at the 
center of a nonspherical cluster and compare them with the characteristic stellar 
orbital times inside the galaxy obtained in Exercise 10.20. For the galaxy, consider 
a y = | ellipsoidal model with «, = 0.1 and n, = 0.2 and physical scales 
M, = 10'!'Mo and a; = 5 kpc. For the cluster, consider € = 0.1 and 7 = 0.2 in 
Exercise 3.10 and estimate p(0) of the cluster from representative values of 
Meuster = 103 Mo and a scale length of 100 kpc. 

Consider a self-gravitating stellar system of density distribution p,.(x), gravitational 
potential ¢,.(x), and finite total mass M,,. The local escape velocity is given by 
Vesc(X) = ./—2¢, (x). Show that the mass-averaged escape velocity is related to the 
virial velocity dispersion of the system by the remarkable identity 


2 1 2 3 2 
(Vago) = vm o P(X) Ug. (X)d°x = 4oy. (10.93) 
* JSS 


This identity plays a relvant role in the study of the gravitational evaporation of 
collisional systems such as globular clusters (e.g., see Chapters 6 and 7; see also 
Bertin 2014; Binney and Tremaine 2008; Chandrasekhar 1942; Spitzer 1987). Hint: 
Use Eq. (10.49). 

Consider a spherical isotropic stellar system of density profile p,.(7) and total stellar 
mass M,,. Determine the expression for the total potential @r(r) so that the system 
is isothermal (i.e., o;(7) = Oo, an assigned constant value) and show that the 
associated total density profile is given by 


2 
o5 d| 5dlinp,(r) 
= 10.94 
ma) as | dr | “o 
and that 
309 M, 
co (10.95) 


Finally, assume that an isothermal (perfect) gas distribution p of temperature 7p is 
at hydrostatic equilibrium in the total potential and show that the density profiles of 
the gas and stars are related as 


umpoe 
p(r) =A x [px(r)] B , (10.96) 


where A is a constant (e.g., see Cavaliere and Fusco-Femiano 1976). Hints: Use the 
isotropic limit of Eq. (13.82) and the spherical limit of Eq. (10.30). 
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Projected Dynamics 


When observed as astronomical objects, stellar systems appear projected on the plane of the 
sky. As a consequence, it is important to set the framework for a correct understanding of 
the relation between intrinsic dynamics and projected properties. Unfortunately, while it is 
always possible (at least in principle) to project a model and then compare the results with 
observational data, the operation of inversion (i.e., the recovery of three-dimensional infor- 
mation starting from projected properties) is generally impossible due to obvious geometric 
degeneracies. Spherical and ellipsoidal geometries are among the few exceptional cases that 
will be discussed in some depth in Chapter 13. Here, instead, the reader is provided with 
some of the general concepts and tools needed for projecting the most important properties 
of stellar systems on a projection plane. 


11.1 The Projection Operator 


Let us consider a given component of a (possibly) multicomponent collisionless stellar 
system described in an inertial orthogonal reference system So = (O;e1,€2,€3) by its 
density 9 = p(x;f). In So we introduce the orthogonal (time-independent) observer’s 
system S’ = (O’;f),f2,f3), and we assume that O = O’ at all times. Throughout this 
chapter, the repeated indices sum convention is used, so that x = xje; = xje) +x2€2 +x3€3 
is a vector in Sp and & = &;f; is the same vector in S’. In S’, the observer’s direction is that 
of f3 (i.e., the observer’s “eye” is placed at infinity on the &3-axis of S’): this direction is 
usually referred to as the line of sight (hereafter Jos). 

We do not use the standard Euler angles! to specify the orientation of S’ with respect 
to Sg, but instead we apply a 3-2-3 rotation (see Figure 11.1). In this way, the first 
(O < @ < 27) and second (0 < # < 7) rotation angles coincide with the polar coordinates 
of f3 in Sg, and e3 and e; are the polar and azimuthal axes, respectively. It follows that the 
positions and velocities in the two systems are related as 


x=Ré V=Rv, R=R3(~P)RIV)R3(W), (11.1) 


! Starting from two coincident systems, the Euler angles are usually defined by a 3-1-3 rotation (i.e., by a rotation around e3 
(gy), then around e (#), and finally around ef (y)). As a result, the angles y and # are not the common spherical angular 


coordinates of ey in So. 
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Figure 11.1 The three successive rotations of angle y around x3 (with xj > x4)s then of # around 


x4 (with x} > x/), and finally of y around x4 (with x/ — &}) needed to define the total rotation 


matrix R between Sg and the observer’s system S’ (see Exercise 11.1). 


where v is the velocity as seen from S’, and the rotation matrix R is derived explicitly in 
Exercise 11.1. As a consequence, the /os direction in So is given by the unit vector n, with 


n= Rf3, n= (cosgsinv, singsinv, cos?). (11.2) 


Notice that from the adopted definition n points toward the observer. Therefore, for a given 
position x, the corresponding &3 coordinate is given by 3 = (n,x). We call the projection 
plane the plane z’ containing the origin and perpendicular to n. This is given algebraically 
by &3 = 0, and it can be identified with the set of two-dimensional vectors € | = (&1,&2); 
similarly, we define the velocity vector perpendicular to the Jos as v, = (14, v2). It should 
be clear that the last rotation of angle y around the Jos simply reorients the projection 
plane; in other words, the projection is intrinsically determined only by the angles ¢ and 2. 
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As we have seen in Chapter 9, all of the dynamical properties of a stellar system are 
known if its distribution function f = f(x,v;t) is known. In So, this function obeys the 
collisionless Boltzmann equation (9.9), and from it all of the moments? can be obtained. In 
S’, the phase-space distribution function (DF) is given by 


f'E.v3t) = f (RE, Rv; 1). (11.3) 


Moreover, as has been discussed in Chapter 10, every macroscopic function F(x) is defined 
by integration on the velocity space of its microscopic counterpart F'(x,v), using f as a 
weight function, and the resulting differential equation for F is given in Eq. (10.8). In 
the observer’s reference frame S’, the microscopic function F’ is given by F’(&,v) 
F(R&,Ryv). The student is invited to prove as a useful exercise that p’F’ = ff’ F’d?v = 
(f f Fay)’ = (pF) = p’F by using the identity d*>v = dv due to the orthogonality 
of R. It follows that under our assumptions the macroscopic properties in the observer’s 
system S’ can be obtained from the corresponding property in So noticing that (prove it!) 
F=F. 

The mass-weighted projection of a given property F = F(x) onto the projection plane 
zc’ is naturally defined as 


EE, )Fios(é 1) = / p' (§)F (&)dés, (11.4) 


where 


xé))= / p (&)dé3 (11.5) 


is the projected density of the system. It is important to note that the definition of projection 
above assumes implicitly that the observer is at infinite distance from the system (as all 
points are projected parallel to n) and that the system is “transparent” (i.e., all of the 
information along n is summed in the projection plane with no superposition of two stars 
along the Jos; see Exercise 11.2). 

We are now ready to present a simple but important result about projection: that for 
arbitrary function F, 


[ 2evFin€ vies = [ pooF ova’. (116) 


The proof can be obtained by inserting Eq. (11.4) into Eq. (11.6), changing the variables 
according to Eq. (11.1), and using the orthogonality of 7. In particular, the result means that 
the mass-weighted surface integral of any projected property equals the (mass-weighted) 
volume integral of the spatial property, and of course this is independent of the /os direction. 
Quite trivially, by assuming F = 1, one obtains that the surface integral of the projected 


2 For simplicity of notation, in the following discussion, the possible dependence on time of the phase-space averages over f 
will not be indicated. 
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density equals the total mass associated with the density distribution p independently of the 
particular orientation of the observer: 


), DE, )d°é | = he p(x)d°x. (11.7) 


Finally, an identical result holds if the mass density p is substituted with the light density 
v when all averages in Eq. (11.6) are intended as luminosity weighted; of course, in the 
case of a constant mass-to-light ratio Y,., where p(x) = Y..v(x), the mass-weighted and 
luminosity-weighted averages coincide. In Chapter 13, we will encounter some illustrative 
cases of projection by considering ellipsoidal (and in particular spherical) systems. 


11.2 Projected Velocity Moments 


In Chapter 10 it was shown that a special status within the infinite family of microscopic 
functions F that can be constructed is assumed by the velocity monomials of any order. In 
fact, these functions enter directly into the virial theorem, and here we take F' to be the N-th 
power of the v component along n. Therefore, we introduce the concept of a phase-space 
velocity field of order N as 


vy’ (v) = (n,v)% = (njv;)” 


22 (i) (7) (ng Eng Say yoy "vy Pvp), 11.8) 


where N > 0 is an integer? and (3) is the standard binomial coefficient in Eq. (A.48). With 
oN (x) we indicate the mean over the velocitiy space of oF (v), and we call this quantity 
the N-th-order projected velocity field along n at position x; according to Eq. (11.4), 
its integration along the Jos is the N-th-order los projected velocity field UN oe (€ |). The 


phase-space average and integration along the Jos lead to similar expressions for oN (x) 


and UW (8.1), with the monomials i ae substituted by (oa) and 


(v3 Ss i I ag respectively. In Summary, uy (v), its average over the velocity space 


of (x), and finally its Jos projection of o(E ,) can all be expressed as linear combinations 
of the corresponding velocity monomials of order N in So, with coefficients depending on 
the /os direction, A function of great importance in observational and theoretical works is 
obtained for N = 1 


Up(x) = (LV) = nv. (11.9) 


Up(x) is called the projected streaming velocity field along n at position x. By construction, 
a positive Up means that the component of the projected streaming velocity along n points 


3 ForN =1, Up = (n,v) = (Rf3,Rv) = v3 from the orthogonality of R. 
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toward the observer and the integration along the Jos of Up represent the los projected 
streaming velocity field Up,,,(& ,.). Note that if the system does not have intrinsic streaming 
motions (i.e., 0; = 0 fori = 1,2,3), then Upiog = Up = 0, but the converse is not true: for 
a given orientation of the Jos, the quantities Up and Up,,, may vanish, but the system can 
still possess streaming motions. Moreover Up), can vanish even if Up # 0. The student is 
encouraged to imagine some illustrative cases of these possibilities. 

Another important family of functions is naturally introduced: the phase-space velocity 
dispersion field of order N 


N 


on (Vv) = (Up — 0p)" = ae Je Poy as (11.10) 


K=0 


With oN (x) we indicate its mean over the velocity space, and we call this quantity the 
N-th-order projected velocity dispersion field along n at position x; its integration along 
the los is the N-th-order los projected velocity dispersion field on. (& | ). In practice, on 
“measures” the dispersion of stellar motions at point x in the direction n with respect to 
the local average value of Up along the same direction, and oN. “measures” the mass- 


weighted (or ee integral of these dispersions along the Jos. Similar formulae 


for on (x) and os gah ,) are obtained from Eq. (11.10) where the terms a x Up are 

replaced by k 7K and @i-* 5, Up 19s, respectively. Of course, if Up Up = 0, then o, qf = = 
oN = yN = = 

ile ai ie ea ‘s eG al 

o2(x) = njnjo;,, (11.11) 

where oF; is the velocity dispersion tensor introduced in Eq. (10.4); its projection is 


‘eee ,). It should be appreciated that in general oN oF P los is not the N-th “observed” los 
ae dispersion field. In fact, the observations give the Jos velocity dispersion centered 
on the /os streaming velocity field Up,,. (€ |), and not on the local 0) (x) as in Eq. (11.10); 


nonetheless, as we will see, on. (é ,.) is an important projected field. 
These considerations lead us to introduce a third and last function over the phase space: 
the phase-space N-th-order local los velocity dispersion field 


N 


N 
oi" (v) = (Up — 0p,,,)% = > (Cah * Oa (11.12) 
K=0 


As with the previous quantities, with oN (x) we indicate the mean over the velocity space of 
V(v) atx, and finally the N -th los velocity dispersion field is given by oy 46 (€ .)- Explicit 


expressions for 0; N(x) and 0; OW jog (E ,) can be obtained by substitution into Eq. (11.12) of 


the corresponding quantities on and a ae respectively; of course, for N = 1, 
Ollos = 0 always. Note that if the system is characterized by Up,,, = 0 for some & , then 


202 Projected Dynamics 


N _ WN GN — aN 7 a ; 
Oy = Up and o',,, = Up jos” Moreover, if Up = 0 all along a given lost through & | , not 
only do the previous simplifications hold, but in also ay = on and a los = a ae In 


particular, the last identity holds everywhere in the projection plane independently of the 
direction n for stellar systems with a vanishing streaming velocity field. 
Before proceeding with the discussion, a comment is in order. As we have seen, the fields 


ay and of (and the associated Jos fields) can be expressed as functions of the fields ok 


(and their Jos projections). Is the opposite true? This is a very important question because 
from an observational point of view we can assume that we have access to the members of 
the family of los Oly, together with v. jos’ The following results provide the answer to this 
question. In fact, for N > 0, we can recast Eq. (11.8) as 


N N 


oo). (ae =) (K) @iea"* or, (11.13) 


K=0 K=0 


and the associated expressions for uy and uy los ate obtained by considering velocity 


averages and projections. Similarly, Eq. (11.10) can be rewritten as 


N 


N eK 
oNwy= >> (J )cokel ae ee (11.14) 
K=0 
and the expressions for oN and a. are obtained by inserting in Eq. (11.14) the terms 
sa aK sa aK 
ov ie 3! and (oj me 3] Jios, fespectively (see Exercise 11.3). 


In the vast majority of theoretical and observational works, the second-order moments 
play a special role. From the phase-space average of Eq. (11.13) with N = 2, 


=o2 +), (11.15) 


because trivially of = 0, and from the phase-space and Jos projection of Eq. (11.12) with 
N=2, 


De. ge — \2_ 79 —2 — 2 
O71 los = “Dios — (pjos) = %D los + UPlos — (Upjos) , (11.16) 


where the second identity follows from the first and from the /os projection of Eq. (11.15). 
Finally, notice how from Eq. (11.16) one can prove the simple but important identity oy logs 


(pj, ae = o?,.. + Ce sometimes (e.g., Cappellari 2008) this latter quantity is indicated 


in the literature as eee 


Two important conclusions relevant to obervational works can be derived from Eq. 
(11.16). The first is that Oo; os Can be greater than zero even in an idealized stellar system 
with no intrinsic velocity dispersion (and so o2,.. = 0), as for example in the edge-on view 
of a “cold” stellar disk where all stars circulate around the disk’s center in the same sense. 


The second is that in fully pressure-supported stellar systems (i.e., 0; = 0 everywhere), 


oe = v2 =o2 .In Chapter 13, we will consider in detail the construction of 
1 los Plos P los 


11.3 Velocity and Line Profiles 203 


the first- and second-order projected velocity fields for the most simple cases routinely 
encountered in stellar dynamics. 


11.3 Velocity and Line Profiles 


A natural question with obvious implications for theoretical and observational works arises: 
How are the fields introduced in the previous section related to the distribution function f’ 
referred to in the observer’s frame of reference S’? We start by introducing the concept of 
a velocity profile (VP), a function defined at the generic point € | in the projection plane as 


XE, )VPE, up) = [, f'(&,v;t) d°v 1 dé, (11.17) 


where we use the fact that vp = v3 (see Footnote 3). In practice, the VP gives the distribu- 
tion of the velocity component vp of all stars along the Jos, obtained by integrating the DF 
over all of the velocities vy;  n and then summing along &3. By definition, the physical 
units of the VP are those of an inverse velocity; moreover, f VP dup = ( le. dé3) /x=1 
(i.e., by construction, the integral of the VP over —co < vp < ov is normalized to 
unity (prove it!)), and of course in stellar systems the VP vanishes for sufficiently high 
values of |p|. 

It is a simple but useful exercise to show that two important projected fields that we 
discussed in the previous section can in fact be obtained from the VP, i.e., 


i Up VP(E 1, Up)dUp = Thigg(E 1), (11.18) 


and 
CO N se, 
/ (Up — Vpigg)” VP(E 1, Up)dup = 0, (E1)- (11.19) 
—CO 


The proof (do it!) simply consists of substituting the definition of the VP into Eqs. (11.18) 
and (11.19) and performing the resulting (formal) integrations. 

We now move another step toward the observational world by linking the VP to another 
extremely important quantity associated with spectroscopic observations of galaxies when 
the spectra of single (in general unresolved) stars are “summed” (luminosity weighted) 
along the Jos. This function is the so-called line profile (LP; i.e., the function of the wave- 
length A describing the shape of the cumulative spectrum obtained by the superposition of 
the red-shifted and blue-shifted lines of all of the stars along a given /os direction n and 
at position & | in the projection plane). Of course, even a partial treatment of the technical 
aspects of the problem is well beyond the introductory nature of this book (e.g., see Binney 
and Tremaine 2008), yet it is important to present the basic ideas of the subject. 

Let us assume for simplicity that the stellar population of our system is homogeneous 
with a position-independent mass-to-light ratio Y,,, and that, when observed at rest, the 
spectral line of interest produced by a given star is described by the normalized function 
of wavelength 2 Py = Po(A), with i Po(A)di = 1. The physical dimensions of Po are 
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therefore the inverse of the length. In the resulting LP, each star contributes the profile Po, 
red- or blue-shifted according to the sign of vp. As a result of the assumed orientation of S’, 
a negative Up (a star receding from the observer along the Jos n) corresponds to a red-shift, 
and so for |vp| < c (where c is the speed of light), a given ) in the spectrum of the star is 
shifted in the observer’s system to A’ = A(1 — vp/c). It follows immediately (prove it!) that 
the normalized shifted spectral LP as seen in S’ is 


rn 1 
Py) = P : 11.20 
v) (i) 1—vp/c ‘ ) 
Therefore, by summing the spectral contributions of all of the stars along the /os, the 
resulting LP is 
e Xr VP(é |, 
LP(é | ,A) =| ro( ) Ga a (11.21) 
268 1—vp/c/ 1—vp/c 


where of course no problems may arise from the formal integration over velocities due to 
the vanishing of the VP at high velocities. Note that from the normalization of the VP in 
Eq. (11.20) and of Po, the LP is also normalized to unity, with i LP did = 1. The simplest 
and idealized case of the LP is given by a perfectly monochromatic line, when Pp is simply 
described by a Dirac 5-function (see Exercise 11.4). 

Now let Ap = (ie Po(A)AdxX be the characteristic wavelength (in the laboratory rest 
frame) associated with the Po profile and let of = is Po(A)(A —Ag)*da be the measure of 
the line width. The same quantities can be associated with the “observed” LP, and in fact 
at every place in the projection plane (see Exercise 11.5) 


00 Vv 
we / LP(A)AdA = (1 = “tu: do, (1122) 
0 G 
00 M,.\2 oe, 
oS [ LPO =2,7ak = (1 _ “rs ) ag 2 a +07), (11.23) 


The identities established here show how from the knowledge of Ao and oo of the spectral 
line of interest and of the observed values of 4, and oy, it is possible to measure Up,,, and 


OF tos at each point in the projection plane! Of course, increasingly complicated relations 
between higher-order moments of the LPs and higher-order projected kinematical fields 
can be established following the same approach as above by considering quantities similar 
to those in Eq. (11.23) evaluated for (A — Ay). In summary, the LPs are natural tools 
that connect theory to observations, and it is quite obvious that a substantial amount of 
theoretical work has been done to develop methods that are able to extract information from 
the observed LPs, with important results concerning the possibility of measuring orbital 
anisotropy or the amount of dark matter in a galaxy. We cannot enter into more details here, 
nor can we give even a partial list of the relevant references, but the student is encouraged to 
explore and study the literature concerning this fascinating subject (e.g., see Carollo et al. 
1995a,b; Ciotti and Lanzoni 1997; Dehnen and Gerhard 1993; Gerhard 1993; Magorrian 
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and Binney 1994; van der Marel 1994; van der Marel and Franx 1993; van der Marel et al. 
1994 and references therein). 


11.4 The Projected Virial Theorem 


In the previous section, we showed how in principle it is possible to measure, at each point 
€, in the projection plane, the values of the projected kinematics. In other words, it is 
possible to recover the two-dimensional maps of the relevant projected kinematical fields 
of a stellar system. Today, thanks to sophisticated observational techniques, this field of 
research is undergoing rapid evolution with fundamental improvements to our knowledge 
of the internal dynamics of galaxies (e.g., see Cappellari 2016 and references therein for 
an excellent review). Here, limiting ourselves to a very elemntary discussion, we simply 
describe what kind of information is contained in the projected fields, and we focus in 
particular on the N-th-order projected virial theorem (PVT; e.g., see Ciotti 1994, 2000; 
Kent 1990; Merrifield and Kent 1990 and references therein). 

First, we obtain the moment equation for the quantity of: we specialize Eq. (10.8) to 
the case F(v) = Jn (v) in Eq. (11.8) and we obtain immediately (prove it!) 

dpvN — dapuN y; —— 
— a = ~ = —NnipvN"! a (11.24) 

Then, we generalize the definition of total kinetic energy by introducing the los N-th-order 
total kinetic energy 


1 — 1 —_ 

N 

Kin = x tee dé) = WN [, p vN dx, (11.25) 
a! ie 


where the last identity follows immediately from Eq. (11.6). In practice, K i is the 
N-th-order generalization of the “kinetic energy” that would be measured by considering 
the N-th-order Jos velocity field in the projection plane, integrated over all of the “image” 


and weighted with the projected density. The two simplest cases are obtained for N = 1,2 


KO=n:P,, Ko 


los los nin; Kjj, (11.26) 
where P; is the i-th component of the linear momentum associated with p, so that in the 
barycentric system it would be zero. 

Following a similar approach to that described in Section 10.5, after multiplication of 


Eq. (11.24) for n;x; = (n,x), integration over the configuration space, and recognizing 


that uN Ujnj = up. a we finally obtain (do it!) the N-th-order PVT 


(N) 
Eh = N+ KATY + NWat”, (11.27) 


where 


2 ea .3 a 
WAND = —njn; [. pus —!x; aa d°x=ninjWyy, (11.28) 
Diss i 
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and 
i / (n,x) pv dx. (11.29) 
3 


In particular, from Eq. (10.56) 


! | —— _ Linn; 
hes = 5 | _ prin (xiDj + xjUi)d°x = =S—. 
Nie 


5 (11.30) 


The projection of the second-order virial theorem is then immediately obtained by setting 
N = 1 in Eq. (11.27) 


njnjlij 


7 = 2K + nin; (w+ (11.31) 


los 


We + Wi 
5 ; 
Of course, this identity is not suprising, as it is simply the double contraction of the tensorial 
identity in Eq. (10.55) over the /os direction n! 


11.4.1 The Angle-Averaged PVT 


As is apparent from Eq. (11.31), when applying the PVT to a system with no special 
symmetries, the direction of the observer /os must be specified, but of course we cannot take 
for granted that in any given practical application we know the shape of the stellar system or 
its relative inclination n with respect to the observer’s Jos. For this reason, even if the idea 
may appear initially strange, it is interesting to work out the formalism needed to compute 
angle-averaged analogs of the formulae derived in the previous section and to explore some 
consequences of the angle-averaged Jeans equations and of the PVT. Among other results, 
we will obtain as a bonus some exact relations holding for spherical systems. In fact, in 
the highly idealized (and thus frequently used) situation of spherical symmetry, the PVT 
cannot depend on the observation direction, and so in spherically symmetric systems the 
angle-averaged results simply coincide with the intrinsic identities. 

Let us start by considering a fictitious “observer” that can move all around a stellar 
system observing some projected property Fjos. In general, this property will depend on 
the los direction n. The angular mean of Fos over the solid angle, determined by our 
observer looking at the system from all possible directions, is naturally defined as 


= 1 = 1 ci 20 
[Fiosla = real Fosd°2 = reall i Fos sin } dd'dg. (11.32) 
4n 0 0 


The last expression is determined by our choice of the orientation of S’ and the two angles 
are the spherical coordinates angles used in Eq. (11.2). Therefore, if we consider the angular 
average of the PVT in Eq. (11.27), we can write 


s IQ — 
IES = (NF DI Kio P10 + NEWe hos. Io- (11.33) 
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Our task is now to obtain the expressions for the quantities involved. This can be done most 
easily by first showing that (do it!) at each position x in the system 


[lo = [0 Jo, (11.34) 
—_ 1 aun to 
Jo = ——_—+—— 11.35 
a (11.35) 
1 [yy la 


[njnjup ‘Ie (11.36) 


~ N(N+ 1) dv;00; 
The identities above are established with little effort by first recognizing that averages over 
the velocity space and angular means commute and then by using Eq. (11.8). The first 
identity shows the obvious fact that velocity averages and angular means commute, and 
the second and third identities show that angular means of special combinations of the 


components of n and a can be computed as velocity averages of velocity derivatives of 


N 
P 
for the moment, with the aid of Eqs. (11.34)-(11.36), it is a simple exercise to show that 


the quantities appearing in Eq. (11.33) can be written (do it!) as 


the angular means of the microscopic functions v; (v). We will return to this point later on; 


1 apr tho 
(N)y : Pp 3 
Ling I= Nel [. pxi dx, (11.37) 
- 1 dor [vp t]o 
(N-1) Pp 3 
W. = -———— _———_? ax, 11.38 
Wr ios le = oy dys Ot ax; dv; dv; shoe) 
1 — 
(N+1) N+1 3 
[Kis Jo = N+1 i plvp Ja dx. (11.39) 


The problem is therefore reduced to the calculation of [up" Jq for a generic N, and in turn 
from Eq. (11.8) this requires the evaluation of terms of the form GC) G) [ae ns nla. 
The algebra involved is quite boring but not difficult, and in Exercise 11.7 the curious 


student will find the steps needed to show that for N > K > L>0 


(ECF tet snl _ Tei ie iils ei) 
Rip? oe Le 8(N + 1) 


(ey 
=| ae) 


(11.40) 


x 
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From the general expression in Eq. (11.40), it follows that for any odd natural number 
N=2n+1 


[yy ae =. (11.41) 


This is because in Eq. (11.40) only terms with even K and L can be nonzero, but then 
N — K is odd, and the result is proved. For N = 2n instead 


ih? vg PO Is ae ae 
[v"lo = UL (Hie “uy Vy, (11.42) 
k=0 1=0 


as can be proved (do it!) by setting N = 2n, K = 2k, and L = 21 in Eq. (11.40). 
In particular, forn = 1, [uslo — Cn + v5 + v2) /3; and so 
Dy + Oe) +H | Of + O99 + 955 


j 11.43 
3 3 ( ) 


[W]e = 
and from Eq. (11.39) 


K 


[Kilo = . (11.44) 


(i.e., the angular mean of the total Jos kinetic energy equals a third of the total kinetic 
energy). The same result can be stated by saying that the angular mean of the second-order 
projected velocity field of a stellar system, weighted over the projected density distribu- 
tion &, is equal to a third of the squared virial velocity dispersion ae of the considered 
component. The angular mean of the second-order PVT reads 


+, (11.45) 


which is of course in perfect agreement with the expression obtained by computing the 
angular mean of Eq. (11.31). Clearly, for a spherical stellar system, Eq. (11.44) also holds 
with [Kilo = Ki? 


los los * 


Exercises 


11.1 Construct the rotation matrix 7 in Eq. (11.1) so that the observer’s Jos is aligned 
along the &3-axis of the rotated system and that in So the direction n of the observer 
is parametrized by the usual spherical coordinates. Show that the three successive 
rotations are described by the matrices 


cosy —sing 0 
R3(g) =| sing cosg O], (11.46) 
0 0 1 


cosv OO sin? 
Roa(v) = 0 i 0 ; (11.47) 
—sinv’ O cost 


11.3 


11.4 


11.5 


11.6 
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cosw —siny 0O 
R3(w)=|sinw cosy OO}, (11.48) 
0 0 1 


and prove Eq. (11.2). Note that R2(7) and R3(y) also appear in Eq. (3.20). 

With this exercise, we estimate the “transparency” of a stellar system (i.e., the prob- 
ability that two stars are aligned along the Jos, thus in principle violating the mass 
(number or luminosity) conservation in Eq. (11.7)). For the galaxy toy model in 
Exercise 1.1, define the average covering factor as 


2 


: NaR5 
Covering factor = aR? (11.49) 


and estimate its value for an elliptical galaxy and a globular cluster. What are the 
conclusions you can draw from this? Notice that the stellar density in real stellar 
systems depends strongly on the distance from the center. 

Prove the identities in Eqs. (11.13) and (11.14). Hint: Rewrite Eq. (11.8) as up = 
(Up — Up + DH) = (Up — Dojog + Tpjo,)™ and Eq. (11.10) as a4 = (vp — Tajo, + 
pee =-\N 

Upios ~ Up): 

Consider a perfectly monocromatic LP with Po = 6(A — Ag), so that P, = 6[A/( — 
Up/c) — Ao] — Up/ c)~|. By using the standard rule for the change of variable in the 
Dirac 6-function in Eq. (A.98), show that 


LP(A) = a [e.. (1 = >) c|. (11.50) 


and verify the normalization property i LP(A) da = 1. 

Prove the identities for A, and o, in Eqs. (11.22) and (11.23). What happens in 
the special case of a monochromatic LP with Pp = d(A — Ao)? Hints: Substitute 
Eq. (11.21) into the integral expressions of 2, and o,. For A,, invert the order of 
integration, integrate over A, and then use Eq. (11.18). For o,, consider (A — hy)? = 
2 = 2ddAy + er from Eq. (11.22) and the normalization of the LP, the integration of 
the last two terms gives —1. To integrate the A* term, invert the order of integration, 
change the variable in the inner integral to A/(1 — up/c) = ¢, and set r= (t— 
ho + Ao), so that from the definition of Pp and its normalization, the inner integral 
evaluates to Oe + do- For the outer integral, write (1 — w/e)" = [1—¥p,,./¢ — (Up — 
Unies expand the integrand, and from Eq. (11.18) and Eq. (11.19) with N = 2 
and the normalization of the VP, finally conclude. 

Estimate the expected value of og for a spectral line produced by the photosphere of a 
star of spectral type (say) G or K and compare the result with the Doppler broadening 
0, due to the motions of the stars in a stellar system, such as an elliptical galaxy. What 
are the practical consequences for Eq. (11.23)? 

Prove Eq. (11.40). Hints: Let a and b be non-negative integers and let Ig(a,b) = 
iP sin” x cos? x dx. From Eg. (11.8) and the definition of the angular mean, first 
show that 
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I,(K +1,N — K)hbyz(K -—L,L 
[nY-K KEL yo - w(K + : Jor ( ) (11.51) 
~ IT 


Then, by successive reduction to the first quadrant, prove that 
Ion (a,b) = [1 + (-1)*? Un (a,b), Ie (a,b) = [1 + (- 1)? Ua(a,b), (11.52) 
and from Eq. (A.53) 


1 a+1b+41 
1npola.b) = 58 ( 7D is 1/2(b, a), (11.53) 


where B(x,y) is the complete Euler beta function. Identity (11.40) is finally 
established by multiplication for the binomial coefficients appearing there using 
Eqs. (A.49) and (A.54) and by performing the needed simplifications. 
Following Exercise 10.1, we now consider the relations between first, and second- 
order projected velocity moments, of a multi-component stellar system described by 
a discrete set of distribution functions f;,. Show that 
oe = 2 
Us = Lek EE Ios)k a7 los = Dur Uk Vems)k = Tse (11.54) 
x x 

where © = )°, Xx is the total surface density of the system and from Eq. (11.16) 
(V2 Jk = (v2, Jk = (62, Jk + (Je Hints: Consider the N = 2 case of Eq. 
(11.12), expand the square and take separately the phase-space average over each fr, 
then project along the Jos and use the first identity in Eq. (11.54). 
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Phase-Space Approach 


Armed with the power of the Jeans theorem, we now proceed to formulate and discuss 
the so-called direct problem of collisionless stationary stellar dynamics. This approach 
is best suited for systems where empirical/dynamical arguments can lead to a plausible 
ansatz for the form of the underlying distribution function (DF), expressed in terms of the 
relevant integrals of motion. In the absence of such an ansatz and in the presence of specific 
requirements (in general motivated by observations) for the density and velocity dispersion 
profiles, a different and complementary approach based on the use of the Jeans equations 
is often followed, which is the subject of Chapter 13. 


12.1 The Construction of a Galaxy Model: From / to p 


In the most general case, the direct problem of collisionless stationary stellar dynamics for a 
system made of n different density components consists of the assignment of the functional 
form of n DFs f;, dependent on a set of isolating integrals of motion of the (unknown) total 
potential and some free parameters. The choice of the integrals of motion can be based on 
physical arguments and/or on the phenomenological properties expected for the resulting 
system. The task is to determine explicitly (usually numerically, but in some very special 
cases analytically) the functions f; through the determination of the total potential @r(x). 
In addition to the potential #(x) produced by the n components, the total potential can also 
contain some imposed potential (a central black hole, an external tidal field, a dark matter 
halo, etc.) that we will call “external” and indicate with @ext(x). Once the total potential 
is determined by solving the associated Poisson equation, the properties of the system can 
finally be studied and compared with observations. Some care should be taken regarding the 
concept of a component of a stellar system: by “component” we mean each part (e.g., each 
stellar and/or dark matter component) of the system that is characterized by specific proper- 
ties (chemical composition, age, mass-to-light ratio, internal kinematics, etc.). In principle, 
the “number” of physically distinct components can be “infinite” when the extended DF 
depends on some continuous parameter (e.g., as in the presence of a continuous distribution 
of metallicity/age/initial mass functions (IMFs) or other properties of stellar populations). 
In this latter case, the total DF is obtained by integration over such parameters, and a simple 
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example is given in Chapter 8 (see also Exercise 10.1); in the rest of the present chapter, 
for simplicity we restrict ourselves to the discrete case of a finite number of components. 

A basic requirement to be satisfied by the DF associated with each component is that 
it has to be non-negative over the phase space. When this requirement is satisfied each 
component is said to be consistent, and a self-consistent stellar system is a self-gravitating 
system for which each component is consistent. For technical reasons, at least for systems 
of finite total mass (or when the total potential @r can be set equal to zero at infinity; see 
Chapter 2), it can be convenient to work with relative energies and potentials. In particular, 
the relative potential Vy and relative energy per unit mass € are defined as 


IIvil? 
Wy = V+ Vext = — — dext, =—-E=WV_7—- = (12.1) 


Thus, in general, the relative potential defines a family of isopotential surfaces enclosing 
more and more volume of space as the value of Wy decreases. 
The previous and qualitative considerations can be formalized as follows: let fi (x, v; ft) 


be regular functions for k = 1, ...,”. From Eq. (9.9), the function 
nN 
f= so (12.2) 
k=1 


is said to be the DF of a collisionless, (self)-consistent multicomponent system if and only 
if fork =1,...,n, 


Ok 4 (v, Vy fe) + (Yur Vy fi) = 0 (12.3) 
fk = 9, (12.4) 
AW, = —47 Gpx, (12.5) 
where from Eq. (9.10) 
= ; = : sy, Ws 3s UW, AW = —47Go. (12.6) 
p » Pk 2 - Skdrv 2. 


Each function f, is said to describe a consistent component. 

The student will recognize that at the core of the problem, leaving aside all technical 
details, these are basically all of the important concepts encountered so far. In practice, 
we ask for a system where each of the components must obey the collisionless Boltzmann 
equation in the total potential through the global integrals of motion in the total potential, 
and the resulting density distribution therefore must obey the Poisson equation. Physically, 
we are asking something quite formidable: we are attempting to determine orbital distri- 
butions such that the resulting total density o produces a potential @ that, possibly added 
to an imposed external potential dext, admits the adopted global integrals of motion! In 
practice, the problem is a sort of “bootstrap” process. As we will see, the “closure” of 
the problem (physically, the consistence, or self-consistence when dext = 0) is provided 


12.1 The Construction of a Galaxy Model: From f to p 213 


by the Poisson equation. Therefore, the student will appreciate that just the possibility of 
formulating this problem is by itself quite a beautiful piece of physics. 

We are now in a position to formulate the direct problem of collisionless stellar dynamics 
for multicomponent stationary systems. For i = 1,...,m, let [;(x,v) be regular functions 
(i.e., at least C“) functionally independent over the one-particle phase space y, with [; =€ 
given! in Eq. (12.1). Moreover, let 


f= fAGeG), B= 1.5% (12.7) 


which is the DF of the k-th component. For each k, A, is a vector parameter “controlling” 
some property of f; (the examples in the following will elucidate the meaning and role 
of the quite mysterious object A;). In practice, f;, depends on a choice of the integrals of 
motion J; of the potential Yr of the total system and on a set of parameters adopted to 
hopefully control the properties of interest of f,. 

For a given k and Xx, the set of points in phase space 


Q, = {Kv Ev: fe =O} (12.8) 


is associated with f;,. In practice, for each f;, the set Qx is the region of phase space where 
J is non-negative for the given choice of A;. Note that from Eq. (12.7) the boundary 0&2, 
necessarily depends on x and v through the integrals of motion J; only. Moreover, for each k 
and A, and for each x € Q,, the set 


Qk = {V2 (XV) € Qz} (12.9) 


is called the velocity section of Q, at x. In practice, Qzx is the set of admissible values of v 
at the position x so that /; is positive for the given choice of Ax. 
The mass density of each component at position x is naturally obtained as 


prix, Ur(x)] = [ fed?v. (12.10) 
kX 


As we will see in the following discussion, not all of the (a priori plausible) choices of 
A, can be physically acceptable. This is formalized by the definition of the three parameter 
sets 


A= Jia Qe AD), (12.11) 
k=1 


n 
Av= lee: : AWy = —47Gpr has a regular/physically acceptable solution}, 


k=1 
(12.12) 
n 
A; = [Jia : [; are global regular integrals of motion for Yr}. (12.13) 
k=1 


! Note that due to the time independence of Wy, the (isolating) integral J; = € is always available. 
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Therefore, the direct problem is solvable if and only if Az = A+N AwN A; 4 GY. Note 
that for Ax € A+ the density px in Eq. (12.10) is non-negative by construction, but its 
dependence on the coordinates x is known only after the problem is solved, and Wy7(x) 
is determined by the solution of the Poisson equation in Eq. (12.6) where the total p is 
obtained by summing the functions in Eq. (12.10). In summary, if the direct problem is 
solvable, from the Jeans theorem the functions f, are (stationary) DFs for each admissible 
Xx, and the stellar system described by f = )-y_, fk is a collisionless, stationary, self- 
consistent, multicomponent system. In applications, the starting point of the procedure is 
the assumption of the desired geometry for the system (i.e., for 7), so that the functions J; 
are usually determined by symmetry considerations. The problem then reduces to the deter- 
mination of the set A, to the construction of the n density functions in Eq. (12.10) as a 
function of x and Wy, to the solution (numerical or analytical) of the nonlinear Poisson 
equation for the total potential in Eq. (12.6) with suitably assigned boundary conditions, 
and to a final check of A3. Only at this stage, after the determination of Yy7(x), is the density 
distribution of each component finally known from Eq. (12.10). 

Before proceeding with the discussion of the most elementary cases, it is important 
to remind ourselves that the “orbital description” sketched out above is at the basis of a 
very powerful numerical method that is used to (numerically) build self-consistent systems, 
namely the Schwarzschild orbital superposition method (e.g., see Richstone 1980, 1984; 
Schwarzschild 1979). Other quite sophisticated methods, directly based on the Hamiltonian 
structure of phase space and in particular on the numerical evaluation of the actions of given 
potentials (e.g., see Binney and Kumar 1993; Binney and McMillan 2016; Kaasalainen and 
Binney 1994; McGill and Binney 1990; Sanders and Binney 2015 and references therein; 
see also Binney and Spergel 1982, 1984), are today being actively studied. All of these 
methods are well beyond the level of the present discussion, and the interested reader is 
invited to consult the more advanced treatises of Bertin (2014) and Binney and Tremaine 
(2008) and references therein. 

We now discuss some general properties of systems that are generated by elementary 
choices regarding the integrals of motion and functional forms for f; in order to introduce 
the student to some illustrative cases that should clarify the previous quite technical but 
necessary preliminary discussion. 


12.2 Spherical Systems 


It is not a suprise that the case of spherically symmetric systems is the simplest from a 
mathematical point of view, and in this family we first consider the even simpler case 
of one-component, self-consistent spherical systems. Yet, from a physical point of view, 
the situation is already interesting enough to allow for a clarification of several of the 
aspects mentioned in Section 12.1, and for us to construct systems of great relevance 
for observational and theoretical works. The student should not be surprised then that the 
majority of models adopted in the research literature are in fact spherically symmetric. 
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12.2.1 Isotropic Spherical Systems 


As we will now see, restricting ourselves to systems with a DF depending only on the 
orbital energy of stars leads us to build stellar systems with a velocity dispersion tensor 
that is necessarily isotropic over all of the space; notice that for simplicity of notation we 
avoid introducing the subscript k and we stress that for the moment we are not assuming 
spherical symmetry. 

We begin by assuming /; = € as defined in Eq. (12.1), and 


f =h(E)@(E) = 0, (12.14) 


where h(E) is some regular function and 6(€) is the Heaviside step function in Eq. (A.99). 
The “truncation” at negative relative energies is a natural requirement for a system of 
finite total mass (or more generally with a potential that can be fixed to zero at infinity; 
see Chapter 2), but in Exercise 12.1 we will briefly address more general cases of energy 
truncation, or of no energy truncation. 

We now prove that from Eq. (12.14) it follows that 


y 
p(Wr) = 4x / : V2(Wr — ENH(ENdE, (12.15) 
0 


yj=0, voj=0, iA j =1,2,3, (12.16) 
and so OF; = 0 fori # j. Moreover (no sum over i intended) 
2 An 3/2 : 
p(Vr)o7 (rT) = 3h [20Wr — E)P'“A(ENdE, i = 1,2,3. (12.17) 


In fact, from Eq. (12.8), Qe = {(x,v) € y : Wr — ||vil?/2 = O}, and according to 
Eq. (12.9) the velocity sections Qxx are spheres of “radius” ./2W7(x). The natural coordi- 
nates in velocity space to be used to integrate over Q;x are the spherical ones, i.e., 


Vy =vsindA cos, Vy =VsindA sin, vz =vCosA, (12.18) 
with 0 < v = |lv|| < /2W7, 0 < A < a, and 0 < p < 2m (see Figure 12.1), so that” 
d°v =v’ sind dv da du. (12.19) 


From Eq. (12.10), one then obtains 


JIU 
p= fav =4n / hv-dv, (12.20) 
0 


2 The angles A and yz coincide with the two angles # and ¢ of spherical coordinates in the configuration space; however, this 
will not be true anymore in the next cases. 
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LX 


Figure 12.1 The decomposition of the velocity vector v adopted to integrate the DF in the isotropic 
case (Section 12.2.1). jz is the angle between e, and the projection of v on the (ex, ey) plane and A is 
the angle between e; and v. 


because f depends only on v. Finally, by changing variable from v to € at fixed x (do it!) 


a ae (12.21) 


Vr — E) 


and so Eq. (12.15) is recovered because? 0 < € < Wr. With a similar procedure, it is trivial 
to show that Eq. (12.16) holds due to the vanishing of angular integrals. Finally 


J2U7 cL 20 
po; = i hv-dv / sind dd i urdu, (12.22) 
0 0 0 


and after integration over the angular variables, Eq. (12.17) is recovered. Therefore, we can 
conclude that under the hypothesis in Eq. (12.14), the density p depends on x only through 


3 Of course, for a generic value €; of the truncation energy, the velocity sections Qxx are spheres of radius \/2[Wp(x) — Et] 


and the energy integration interval is E, < E < Wp(x). Absence of truncation means E; = —oo and Qy, = #3; see also 
Exercise 12.1. 
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Wr (Le., e = e(Wr)); streaming motions cannot be present and the velocity dispersion 
tensor is globally isotropic (and so v? = o; fori = 1,2,3). 

As has already been stressed, up to now we have not yet imposed spherical symmetry 
(i.e., the shape of Vr is not specified), and for this reason we maintained in all generality 
a Cartesian decomposition of the velocity vector in Eq. (12.18). In turn, as all stationary 
stellar systems (independently of their shape) admit the orbital energy as a global isolating 
integral of motion, one could feel free to imagine the existence of (isotropic) stationary 
systems without special symmetries characterized by DFs that depend only on the energy 
(perhaps simply because other isolating integrals do not exist!). However, an important 
(and perhaps unexpected) consequence of global isotropy can be derived by imposing self- 
consistency through the Poisson equation 


AW = —47 Gp(Wy). (12,23) 


In fact, a quite remarkable mathematical theorem (Gidas et al. 1979; see also An et al. 
2017; Ciotti 2000) states that the only positive and nontruncated solutions for Eq. (12.23) 
(with YW = Wr) are spherically symmetric, and so we conclude that a nonspherically sym- 
metric, spatially untruncated, one-component stellar system with no other regular integral 
of motion in addition to E cannot be stationary (and so even € is not an integral of 
motion). In fact, we assume the system to be stationary, so that € is an isolating integral 
of motion, and nonspherical, with no other regular integrals of motion. From the Jeans 
theorem, necessarily f = f(€), but from the previous result, the Poisson equation does 
not admit acceptable solutions. This contradiction implies that one of the assumptions is 
logically inconsistent (i.e., the system cannot be stationary). As a consequence, in order 
to construct nonspherical, self-consistent stellar systems, at least two regular integrals of 
motion are needed. 


12.2.2 Anisotropic Spherical Systems 


We now proceed to explore the general consequences of assuming a two-integral DF for a 
spherical stellar system. The most natural assumption (but not the only one!) is to adopt as 
global integrals the orbital energy and the modulus of the orbital angular momentum. 

Therefore, we assume that the total potential Yy(r) is spherically symmetric, with r = 
\|x|| and 7; = €, fb, = ||J\|? = J? (where J = x A vis the angular momentum per unit mass 
of each star). Moreover, let 


f =h(E,J*)0(E) = 0, (12.24) 


where again we allow for a possible energy truncation. We will prove that 


4n pT r/2UT-E) AE, J?) TdT 
_ as [ n(E, J*) 
0 0 


: 12.25 
° J2br — €) — Pe]? es 


p 
: 


Vp = Vp = Vo = UpVZ = VpVg = Vp Ug = 0. (12.26) 
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Moreover 


, 4n Wr r/2(W7—E) J 
po? = = | as [ h(E, J?), Wr — €) — = JJ, 1297) 
r 0 0 r 


2 4a AY ET dd 
r* Jo 0 Vy — €) — J2/r? 


(12.28) 


In fact, from Eq. (12.8), Q, = {(x,v) Ee y : Vr IIv||7/2 > 0}, and according to Eq. 
(12.9), the velocity sections Qxx are spheres of radius ./2W7(r). In this case, the natu- 
ral coordinates in configuration space are the spherical ones (r,%,@), and the associated 
velocity components are (v;, U9, Ug). From the geometry of Qxx, the natural coordinates to 
perform integrations over the velocity space are the spherical ones, with 


Vp = VSINA COS, Vg =VUSINA sinpL, Vv, = UCOSA, (12.29) 


with 0 < v = |lv|| < /2W7(r), 0 < A < wz, and 0 < mw < 27. Note that J = rv,;, with 


vu = ti + Je is the tangential velocity, and v= v2 + eg thus, vy and vg appear in 
the DF only through vy, = v sina and f is idependent of jz. From Figure 12.2, the student 
will notice that the angles 4 and y are defined with respect to the local spherical coordinate 
system, so they are not the same as in Eq. (12.18); of course, we could have also used 
the present velocity decomposition in the isotropic case in Section 12.2.1 once spherical 
symmetry is assumed. 

In order to proceed with integration, we first use Exercise 12.2 and then we change the 
coordinates from Eq. (12.29), finally reobtaining Eq. (12.19). After integration over jz and 
reducing the interval of integration of A from (0,7) to (0,2 /2), the velocity moments of 
interest are immediately written as 


JIT x/2 
p= ax [ vav f h sind dh, (12.30) 
0 0 
JIT x/2 
po, = ax [ via [ h (cosa)? sina da, (12.31) 
0 0 
JIT x/2 
poe = 4x i vtdv / h (sina)? da. (12.32) 
0 0 
With a last change of variables from (v, A) to (E, J) (do it!), one finally obtains 
dEdJ 
dvd = (12.33) 


r JUG = Ey Ubr — E)— Pfr? 


with 0 < € < W7,0 < J < r./2(Wy7 — €), and this completes the proof of Eqs. (12.25)— 
(12.28). The student is invited to evaluate the identities in this section for independent of 
J? and to recover the corresponding indentities in Section 12.2.1. 
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6b 


Figure 12.2 The decomposition of the velocity vector v adopted to integrate the DF in the two- 
integral case for spherical systems (Section 12.2.2). w and A are the usual spherical angles in the local 
reference system (f,,f9, fp). Note that we could have also used this decomposition in the isotropic 
case instead of Cartesian coordinates. 


Therefore, at variance with the isotropic case, now the density p depends explicitly on 
bothr and Wy (i.e., o(r, Yr) in Eq. (12.25)); streaming motions cannot be present and Oo; = 
os = a /2, where o; is the tangential velocity dispersion. The velocity dispersion tensor 
is therefore anisotropic, and the associated velocity dispersion ellipsoids are axisymmetric 


around the radial direction. In order to characterize this type of anisotropy, the function 


_ oe (r) 
20? (r) 


BO)= (12.34) 
is introduced (e.g., see Binney and Tremaine 2008). When 6(r) = 0, the system is isotropic 
(at r); when B(r) > O, the system is radially anisotropic (i.e., the velocity dispersion 
ellipsoid at r is prolate); and when f(r) < 0, the system is tangentially anisotropic (i.e., 
the velocity dispersion ellipsoid at r is oblate). In Exercise 12.3, two important and general 
identities about anisotropic spherical systems will be proved. 


220 Modeling Techniques 1: Phase-Space Approach 


12.2.2.1 Fully Radial and Fully Tangential Systems 
Two very special families of stellar systems are now considered: the fully radial and 
the fully tangential spherical systems. The fully radial case is obtained from Eq. (12.24) 
for f(E,J*) = h(E)5(J*)0(E). With elementary transformations, Eqs. (12.25)—(12.27) 
become (do it!) 


Ww 
7 zt T h(E)dE apes) 
0 


7 Vi — €)" 


: 
> 20 Vr 
po, = 2 , h(E)/ 27 — ENdE, (12.36) 


while of course o¢ = 0. The fully tangential case (i.e., a density distribution realized by 
circular orbits only) is obtained with a DF of the form 


pr) 
Fup, U7) = d(u, )d[v; — ve(r)], (12.37) 
27 Ue (r) 
where v2(r) = —r dWy/dr is the circular velocity at radius r. The identities 7 = 0 and 
oa? = v2 are easily checked. Note that in this case the DF is not written explicitly in terms 


of the integrals of motions € and J, but this can be done by recalling that for circular orbits 
of energy € the function r = r(€) is determined from 


r dWVr(r) 
E=W =i ; 12.38 
Ls erm ( ) 
while the angular momentum per unit mass of circular orbits is 
dW 
Peep (12.39) 
dr r=r(E) 


(see Chapter 5). 


12.2.2.2_ Osipkov—Merritt Systems 
A very important and widely studied family of spherical, anisotropic stellar systems is 
given by the so-called Osipkov—Merritt (OM) systems (named after their two independent 
discoverers, Merritt 1985a; Osipkov 1979). In this family, one defines the quantity 
JZ 
where r, is a free parameter called the anisotropy radius (for reasons that will become clear 


in the following), a first example of the parameter(s) A, introduced in Section 12.1, and we 
assume 


O=e= (12.40) 


f =h(Q)0(Q) = 0. (12.41) 


We now prove that 


Wr 
p= ara | V2(Wr — Q)hA(Q)dQ, (12.42) 
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w 
po, = 47 B(r) [ "207 — QyP?2n(Qyag, (12.43) 
Wr 
po, = ancy) | [207 — O)]7/7n(O)dO, (12.44) 
where the three radial factors are 
Ary= Bry) = 2) ee 2A") 12.45 
ak LTT N= (r) = 3 : (12.45) 


The natural coordinates in configuration space are again the spherical ones. In fact, 
Qe = {(%V) € ys Wr — |Ivll?/2 — J7/2r?2 = Wy — v2 /2 — v2 (1 + r?/r?)/2 > 0}, and 
the velocity sections Q; x are rotation ellipsoids around f, (see again Figure 12.2). As a 
consequence of the geometry of Q; x, the natural coordinates for integration in velocity 
space are the ellipsoidal ones, 1.e., 


vsinA cos vsindA sin 
= —————.,,_ ly = vcosiA, (12.46) 


v3 = SS , Ug = ’ 
JVl+r?/r2 . Vl+r?/r2 


with 0 < v < /2Wr7(r),0 <A <a, and 0 < pw < 27, so that from Exercise 12.2 we now 
have (prove it!) 


3 v? sind 


= — dd du. 12.47 
acre Me ( ) 


Note that v, = v sind/,/1 + r2/r2. With the transformation in Eq. (12.46) f becomes inde- 
pendent of jz and 4, with 


Jy m/2 
p= aracr) | v nav [ sind da, (12.48) 
0 0 
JIT x/2 
po? = 4r A(r) / vt hdv / sin A(cos A)? da, (12.49) 
0 0 
JIT x/2 
poe = 4n A7(r) / vt hdv (sina)? da. (12.50) 
0 0 


Performing the elementary integration over A and changing the variables from v to Q, one 
finally obtains (do it!) 


fre (12.51) 


J2(r = Q) 
and so Eqs. (12.42)-(12.44) are recovered, because 0 < Q < Wry. OM stellar systems 
are obviously a subset of those described in Section 12.2.2; they do not present stream- 
ing velocities, while their velocity dispersion tensor is anisotropic with two different 
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components, a, and o. The relation between radial and tangential velocity dispersions in 
OM systems is very simple, and from Eq. (12.34) we find 


r2 


Bir) = aa (12.52) 
As a consequence, in OM systems, the velocity dispersion tensor is isotropic at the center 
independently of the particular value of rg. When rg — oo, B(r) = OVr => O (ie., the 
system is globally isotropic) due to the fact that now Q = €. For fixed rg the velocity 
dispersion tensor becomes more and more radially anisotropic for r — oo, while inside rg 
the velocity dispersion tensor becomes more and more isotropic. This is the reason why the 
parameter r, is called the anisotropy radius. OM models are widely used in applications 


due to their flexibility and simplicity of use (see Chapters 13-14). 


12.2.2.3 Cuddeford Systems 


An interesting generalization of OM models is given by the Cuddeford anisotropic models 
(Cuddeford 1991). Let Q be defined again by Eq. (12.40) and assume 


f = "n(Q)0(Q)=0, a>, (12.53) 


so that OM models are recovered for a = 0. Then, 


wv 
p= 4racr.a) f "Dur — Q)*'7n(Q)dQ, (12.54) 
0 
vr 
po, = An B(r,a) / [2(r — Q)IP7h(Q)dQ, (12.55) 
0 
Wr 
po? = axc(r.a) [1200 — Qn"? Qd9, (12.56) 
0 


where now 


Jn Ta +1) 2a 4a 


_ HI 
A(r,a) = OTe 43/2) +3/D re“A (r), (12.57) 
as Vr T@+) 2a gatl 
Bir,a) = 4P@ + 5/2). A (r), (12.58) 
Ciro) = VETO +2) 20 4042p), (12.59) 


(a+ 5/2) 


the function A(r) is given in Eq. (12.45), and I'(x) is the complete Euler gamma function 
in Eq. (A.42). The proof of the identities above is similar to that of OM models: with the 
same change of coordinates, one obtains 


=4 2a aor oe 2a+2 ne : 2a+1 
p=4nr (r) v hdv (sin A) di, (12.60) 
0 0 
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where the convergence of the angular integral requires ~ > —1. Moreover 


J 207 x/2 
poe = Anr* A%*1(r) / vtthdy i (sin d)?"+! (cosa)? da, (12.61) 
0 0 


Jy m/2 
poe = 4nr™ A**?(r) ii v4 dy / (sina)*+3 dd. (12.62) 
0 0 


After integration over 4 by using Eqs. (A.53)-(A.54), the change of variable in Eq. (12.51) 
completes the proof. 
The velocity dispersion anisotropy profile of Cuddeford models is 
Ve voor (12.63) 
B(r) = Page : 

Therefore, for —1 < a < 0, the function f is positive and the velocity dispersion is radially 
anisotropic at all radii independently of the particular (finite) value of r,, while as expected 
for a = 0 one recovers the OM anisotropy. Finally, for a~ > 0, the anisotropy is tangential in 
the inner regions with r < ./ar, and radial for r > ./ar,; in the limit « — oo, the orbital 
structure is fully tangentially anisotropic (i.e., 8 —-> —oo). Note also that for rg — 00, 
B(r) = —a Yr > 0 (i.e., the systems are characterized by a constant anisotropy profile). In 
practice, Cuddeford models can be seen as simple generalizations of both OM models (for 
a = () and constant-anisotropy models (for r, — oo), having the advantage of describing 
some amount of tangential anisotropy without the problems of OM tangential models (see 
Exercise 12.5). 


12.2.2.4 Generalized Cuddeford Systems 


We finally consider a very general family of models containing as special cases the mod- 
els encountered in the previous sections. These models have been introduced in Ciotti 
and Morganti (2010a), and they will be used to prove some quite interesting results in 
Chapter 14. We refer to this family as the multicomponent, generalized Cuddeford systems, 
in which the DF associated with each density component is the sum of an arbitrary number 
of Cuddeford DFs in Eq. (12.53) with arbitrary positive (constant) weights w; and possibly 
different anisotropy radii rj; in other words, for each component 


PZ 


—, a@>-l. 12.64 
272 ( ) 


f=P* > wih(Qi)0(Qi), Qi =E- 
i 
Note that for a given density component, the function A and the angular momen- 
tum exponent a are fixed, but they can be different for different components; the list 
(@,W1,.--,/al,---) 18 another example of the vector parameter Ax. The student should 
prove that Eqs. (12.54)-(12.56) still hold, where now 
afm Vio 1)r™ 


Wi 
MO) = “sr @ 43/2) rrr ee 


i 
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afr Via + yr Wi 
B(r,o) = 12. 
i aera — (Lp r2/rayer eee) 
2a ‘ 
C(r,a) = velo Ss bal ; (12.67) 


2r(a + 5/2) (Lt r2/r2yot2 


i 
Of course, the orbital anisotropy profile associated with Eq. (12.64) is in general not a 
Cuddeford one, being 


C(r,a) 2 
2B(r,a) 


pO es eat ie all 
Do; wif + 2 /rZye+ 
Quite general anisotropy profiles can be obtained through specific choices of the weights 
w;, the anisotropy radii raj, and the exponent w. However, near the center, B(r) ~ —a and 
B(r) ~ 1 for r — oo independently of the specific values of w; and rg; and the number of 
subcomponents 7. We conclude this discussion by stressing that the given examples should 
not be considered to be even a partial list of the models that can be found in the literature; 
for example, the student is invited to consult Cuddeford and Louis (1995), Gerhard (1991), 
and Louis (1993). 


Bir) =1 (a + 1) (12.68) 


— 


12.2.2.5 Stellar Polytropes, King, Wilson, Michie, foo, and f™ Models 


Having introduced some of the most used families of DFs for spherical stellar systems, 
we now move on to one of the most important cases of the direct problems of stellar 
dynamics: the construction of the family of so-called spherically symmetric self- gravitating 
stellar polytropes. The applications of this are numerous and far-reaching because from a 
mathematical point of view the same problem is encountered in the construction of self- 
gravitating, polytropic gaseous spheres (see Chapter 10; see also Binney and Tremaine 
2008). The starting point is to consider the isotropic DF 


f =An(E - &)" 7 O(E — E), (12.69) 


where n is a free parameter and € is a truncation energy. From Eq. (12.15) with Yr(r) = 
W(r) (the self-gravitating condition) and intoducing = W — €;, one immediately obtains 
(prove it!) 

22)3/*T(n — 1/2 1 

p= BrP"0(®), By = An’ 2 a / 2 n>-, 

T(n + 1) 2 

so that the density vanishes at the radius 7; (if it exists!) where ®(7,) = O (i.e., where 

W(r,) = €,). Moreover, it is a simple exercise to show that from Eq. (12.17) each of the 
three identical diagonal terms of the velocity dispersion tensor is 

co 
of = —— 


a ntl 


(12.70) 


6(®) =o", (12.71) 


so that the velocity dispersion also vanishes at the system edge and it is linearly proportional 
to YW — &,. Notice that no sum is intended over the repeated indices. 
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The Poisson equation in spherical coordinates is 


ld ,d® 
r2 dr (: dr ) = —41 GB, ®"6(®), (12.72) 


with the regular boundary conditions ®(0) = ®p and d®/dr = 0 at r = 0, and the Heav- 
iside function takes care of the fact that the system density is zero outside the truncation 
radius. Two physical scales can be associated with this problem, and Eq. (12.72) can be 
cast in dimensionless form by introducing 


r ® 1 


s=-, = , Oo= : 12.73 
ro yo > Ja GB, Bp — 
The resulting dimensionless equation is the well-known Lane—Emden equation 
ld (4d A 
= O(~), 12.74 
an (: 7, ) y"0(y) (12.74) 


the solutions of which have also been investigated extensively in the context of stellar 
evolution (e.g., see Chandrasekhar 1939). It can be proved that: (1) forn < 5, the total mass 
associated with the density distribution is finite and the density vanishes at a finite trunca- 
tion radius r;; (2) for n = 5, the density is spatially untruncated, but the mass is still finite; 
and (3) for > 5, the density is nontruncated and the mass is infinite. Moreover, analytical 
solutions exist for two values of n > 0, namely n = | (the linear Helmholtz equation) 


sin(s) 


, ifs <7, 


o= (12.75) 


and n = 5 (the Schuster solution) 


(12.76) 


i a 
JV1+s2/3 
In both cases, the total mass can be found explicitly from the Newton theorem, namely 


2 
dd d 
ii a ie (12.77) 
ron G dr G sos ds 


(i.e., My = m®oro/G and Ms = V/3®or0/G). In the astrophysical literature, the n = 5 
case is also known as the Plummer model (Plummer 1911; see also Chapter 13). 

The term stellar polytropes for the objects in Eq. (12.70) derives from their formal 
similarity to gaseous polytropes (e.g., see Chandrasekhar 1939; see also Section 10.4.1). In 
fact, the equation of hydrostatic equilibrium (10.30) for a self-gravitating gaseous sphere 
with polytropic stratification p « p” (where y is the polytropic index) reads dp/dr = 
—pdd/dr, and its solutions are given by Eq. (10.31). In particular, notice how Eqs. (12.70)— 
(12.71) are formally identical to Eq. (10.31) provided 


M= 


1 kgT 
yslt-, =, B=, Ga (ltn)of, 12.78) 
n ()Mp oj" +n)" 
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where Wo, 00, and po are the values of the potential, the velocity dispersion, and the density 
at some distance ro for the center, respectively; in particular, notice that from the second 
of the identities in Eq. (12.78) we have the identification oe = fo from the last identity of 
Eq. (10.31). Therefore, we have obtained the important result that the density and velocity 
dispersion radial profiles of a stellar polytrope of index n are the same as the density 
and temperature radial profiles of the gaseous polytrope with the same potential and with 
parameters as in Eq. (12.78). 

A special and extremely important limiting case of polytropic systems is the so-called 
isothermal sphere, obtained from the DF 


__ Al a 
if => Oro’ Oj, =O , (12.79) 


where / is a density scale, no energy truncation is imposed, and for which the velocity 
dispersion is independent of the radius (see Exercise 12.6). In fact, if we consider the formal 
analogy (for suitable choices of the parameters) of stellar and gaseous polytropes and that 
the limit case of a isothermal gaseous equilibrium in Eq. (10.32) is obtained from the limit 
y — 1 of gaseous polytropes, we can reasonably expect that Eq. (12.79) can also be 
obtained as the limit of Eq. (12.69) for n — oo, after B, and € are given by Eq. (12.78). 
We prove this result in Exercise 12.7. 

Other isotropic DFs are obtained by truncating the isothermal sphere for € < & (where 
€ 1s a truncation energy mimicking the effect of escapers above the energy —&; see also 
Section 6.2.1). Important cases, often adopted to model globular clusters (e.g., see Bertin 
2014; Binney and Tremaine 2008; Spitzer 1987), are the King models (King 1966) 


5 i ee 
ik = oan (e* ge Joe a3 (12.80) 
and the Wilson models (Wilson 1975) 
pl fo _& 7 
fw= Ono237 (<* eo 52 O(E — &). (12.81) 


Anisotropic spherical models can be obtained by modifying the King/Wilson models. For 
example, the Michie models (Michie 1963) are obtained for 


2 


fu = fx(€) x0 28? (12.82) 


Finally, special mention is due to the fo models (e.g., see Bertin 2014; Bertin and Stiavelli 
1984; Stiavelli and Bertin 1985, 1987; see also Merritt et al. 1989 for the “negative temper- 
ature” version of these models), associated with a DF that in the spherical limit becomes 


foo = AE3e%F I" (8), (12.83) 
and to the f”) models (e.g., see Bertin 2014; Bertin and Trenti 2003; Stiavelli and Bertin 
1987) 


yp v/2 
fe Act’-4( err) a(é). (12.84) 
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These two last families are constructed on the basis of statistical mechanics arguments, 
and the associated density distributions, obtained by numerical solution of the associated 
Poisson equations, reproduce remarkably well the density (light) distribution of elliptical 
galaxies (see also Section 6.2). Other self-consistent models built to describe galactic disks 
coupled self-consistently with a dark matter halo or considering the effects of the galactic 
tidal field and internal rotation, which have been particularly successful in the modeling 
of globular clusters, have been recently proposed (Amorisco and Bertin 2010; Bertin and 
Varri 2008; Varri and Bertin 2009, 2012, and references therein). Unfortunately (but not 
unexpectedly), for all of these models, only numerical solutions of the Poisson equations 
can be found. 


12.3 Two-Integral Axisymmetric Systems 


We now move to consider the case of axisymmetric stellar systems, so that the total poten- 
tial can be written in cylindrical coordinates (R,g,z) as Vp = Wy(R,z), independent of 
the azimuthal angle g. We study the natural case of a DF depending on the two classical 
integrals of motion related to the assumed symmetry (i.e., J) = € = Wr — |lv||7/2) and 
In = J, (the component of the angular momentum along the z-axis), with 


f =h€E, Jz)0(E). (12.85) 


In complete generality, let f+ be the even (odd) component of the DF with respect 
to Jz, i.e., 


FE, J,) = FE, = Je) 


SLE, J.) _ 2 (12.86) 
We now prove that in these systems 
a fet RJ2(H7-E) 
= | az [ ha (E, Jz)d Jz, (12.87) 
UR = Vz = URVz = URVg = VzVy = 0, TR =P, (12.88) 
An Wr R/2(W7—-E) 
puvg = = | az [ h_(E, Jz) Jzd Jz. (12.89) 
Moreover 
_ o RJ207—E) 
pv2 = aia e | WET) I dy, (12.90) 


and with the introduction of the meridional velocity dispersion raed 


An pr RJ2(0—E) # 
=> az [ hs (€. de) | 20by = €) — = | ad. (12.91) 
0 0 


= 02 +07 =207 


Bw 


po 
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In practice, in a two-integral axisymmetric system, ordered rotation can be present only 
around the z-axis, and only the even part of the DF contributes to the density. Furthermore, 
p = p(R, Vr), obtained from Eq. (12.87), depends explicitly both on R and Wy. The only 
admissible streaming motion is along the azimuthal direction, and it is determined uniquely 
by the odd component of the DF. The velocity dispersion ellipsoids are aligned with the 
local basis (f ee and are cee: symmetric around the azimuthal direction fy, 
with oO; = =u = Ue and o% R= of = =o, af 2. Thus, the systems described here can be either 
‘ono or svipmutally anisotropic. In the isotropic case, the velocity dispersion ellipsoid 
is spherical and the deviation from spherical symmetry of the system is due to the ordered 
rotation field ug, in analogy with rotating fluid systems. By contrast, when vg = 0, all of the 
deviations from spherical symmetry are supported by the tangential velocity dispersion oy. 
In the next chapter, we will discuss these important aspects in some detail. Finally, note that 
as a spherical system is also axisymmetric, in principle a collisionless stellar system could 
have some net rotation (Lynden-Bell 1960). Of course, this is impossible in the standard 
cases encountered in fluid dynamics, as the tensor pressure is isotropic (see Chapter 10). 

From Eq. (12.8), Q, = {(x,v) € vy : Ur—- I|v||7/2 > O}, and so the velocity sections Qxx 
are spheres of radius ./2W7(x). In this case, the natural coordinates in configuration space 
are the cylindrical ones (R,@,z) and the associated velocity components are (vp, Ug, Uz). 
Note that in the . vp and Ys appear only in the combination v2, = ve + i, because 


m 


Jz = Rug, while v? = v2, + a; , where v,, is the so-called meridional velocity. As a direct 


consequence of the geometry ‘of Qxx, the natural coordinates for integration in velocity 
space are the spherical ones, with 


VR =USINA Sinfl, Vz =USINA COS, Ug = VCOSA, (12.92) 


with 0 < v = |lvl| < /2W7,0 < A < mz, and0 < w < 27 (see Figure 12.3), so that 
Um = vsindA and f is independent of jz; finally (prove it!) 


d>v = durdvzdvg = v* sind da dy. (12.93) 


Integration of jz and reduction of the integration over A from (0,77) to (0,7 /2), taking into 
account the parity of f4 and f_, leads to 


J2UT n/2 
p= ax [ Pav f h, sina da, (12.94) 
0 
aa 
pUg = 41 / ‘af h_sinkd cosa di, (12.95) 
0 0 
= 2Wy 
pus = 4n / utdv fo 4 sind (cosa)? da, (12.96) 
0 0 


2 
po. =4n / utd << h,(sina)? dd. (12.97) 
0 


So 
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wv 


Figure 12.3 The decomposition of the velocity vector v adopted to integrate the DF in the two- 
integral case for cylindrically symmetric systems (Section 12.3). Due to the degeneracy of the velocity 
components in the meridional plane, the natural coordinates are the spherical ones, with the colatitude 
A measured from fy and the longitude jz measured from f, = e; in the plane (f;,fp). 


Finally, by changing the coordinates from (v, A) to (€, Jz) 
dédJ, 
RVI = EY 20 — €) = J2/R? 


and so Egs. (12.87)-(12.91) are recovered, because 0 < € < Wy7,0 < J, < RV2(Wy7 — €). 


dvdi = 


(12.98) 


12.3.1 Fricke’s DF 


A particularly simple and useful specialization of the general identities just derived is 
obtained by considering the special family 


ff, = AlJ_°2E80(E), A>O, a> 1/2, Bo-l. (12.99) 


Jz 
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This choice determines the so-called Fricke models (Fricke 1952). From Eq. (12.87), one 
obtains immediately 


An2%*9/*B(a + 3/2, 6 + 1) 
a 2a + 1 

where B(x, y) is the complete Euler beta function in Eq. (A.53). Note that for regular 
solutions and aw > 0 the density vanishes along the symmetry axis, so that the shape of the 
density distribution is qualitatively “toroidal.” Note also how the density is a combination 
of the powers of radius and potential, in striking similarity to the dependence of the DF on 
angular momentum and energy. The recovery of the analogous expressions for oo and o¢ 
is left to the reader. The relevance of Fricke models, beyond their mathematical simplicity, 
will be briefly addressed in Chapter 14. 


R22 WATE 26 Cr), (12.100) 


12.3.2 A Special Case of Two-Integral Axisymmetric Systems 


These systems can be considered as the axisymmetric generalizations of spherical Cudde- 
ford systems. Let 
fp 
f= fe =lsPh(Q)0(Q), Q=E- Te a> —1/2, (12.101) 
a 
where R, is a free parameter called the anisotropy radius. Note that for Rg — oo, fy 
can also be interpreted as a generalization of Fricke models with respect to the energy 
dependence. No streaming rotation is present. 


We now show that for these systems 


w 
p= 4ra(R.a) | "2b — 9)|**'"n(Q)d9Q, (12.102) 
0 
Uy 
po, = 4rB(R.a) | [2(Wy — O)]°*?/2n(O)dO, (12.103) 
0 
Wr 
poz = arc(R.a) | [2(Wp — Q)]°t7/27n(O)dQ, (12.104) 
0 
where 
Re 
A(R,a) = Ga FD RE RFT" (12.105) 
Re 
B(R,a) = Gar FE RRS" (12.106) 
2A(R,a) 
C(R,a) = (12.107) 


2a+3° 
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The natural coordinates in configuration space are the cylindrical ones. In fact, Q, = 
{&v) € y : Wr |Ivll?/2 — J2/2R2 = Wr — v7,/2 — vg(1 + R?/Rz)/2 = 0}, so that 
the velocity sections (xx are rotation ellipsoids around the azimuthal direction eg. With f 
being an even function in J,, no streaming motions can be present (vg = 0), and the density 
flattening is due to azimuthal velocity dispersion oy only. As a direct consequence of the 
geometry of Qxx, the natural coordinates for integration in velocity space are the elliptical 
ones, i.e., 


vcosxr 
JI + R2/R2 


with 0 < v < /2W7,0 < 4 < mz, and 0 < w < 27. Note that v,, = vsind and that h 
depends only on v; moreover (prove it!) 


VR =USINA COS, Vz=VSINA SiN, Vg_g = (12.108) 


3 v* sind 
a-yv = ——————- dd du. (12.109) 
J/1+ R?/R2 
Integration over jz and reduction of the integration interval of A from (0,7) to (0,2/2) 
leads to 


An R2 S207 m/2 
pe — / hv**? dy / sin A (cos A)?” da, (12.110) 
(Lop RA RA)OTN? Io 0 
A RX S27 m/2 
jo, = - ee i hv4dy i, sind (cosa)"*t? da, (12.111) 
(1 RARE) TS Jo 0 
A RX S27 m/2 
por = : —— if hv-+4dy / (sin A)? (cosa)?% da. (12.112) 
(1+ R2/R2)¢+1/2 Jo 0 


Finally, integration over A by using Eqs. (A.53)—(A.54) and the change of variable 


dQ 
dv = -—=—., 0< O< VW, (12.193) 


V2 — Q) 
completes the proof. Note that for convergence a > —1/2. 
For this family of models 
2 2 2 
oO R* — 2aR; 
A(R.) =1-—8 “! 


= 12.114 
2/2 +R eer 


(i.e., in these systems, the distribution of anisotropy is independent of z and of the specific 
form of /(Q)). The interpretation of anisotropy for increasing R and for various a > —1/2 
is left to the reader. As for the simpler case of Fricke models and for these models when 
a > 0 and h is sufficiently regular, the associated density distribution is of toroidal shape, 
with p vanishing on the symmetry axis. 
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Exercises 


Let us consider a system with f = f(E)O0(E — &) truncated at relative energies 
below some value €;. Show that Eqs. (12.15)—(12.17) still hold with a lower limit of 
integration €; instead of 0, and conclude that for given Wry a spatial truncation of the 
density will occur on the surface Yr(x;) = €. If no energy truncation is imposed 
on the DF, then & = —oo. Hint: See Footnote 3 of this chapter. 

From the identity v = vyey + vyey + vze; = u;-f, + vefy + vf, show by direct 
evaluation of the Jacobian that 


d?v = duxdvydvz = du,duydvg. (12.115) 


Moreover, show by direct evaluation that from J = x A v one obtains 
J =| =rv, vp =sfv5 + v2. (12.116) 
Hint: See Exercise 9.2. 


For a generic spherical, anisotropic stellar system with f = f(€,J7), by using 
Eqs. (12.25), (12.27), and (12.28), define the radial and tangential “pressures,” 
respectively, as 


P(r, Vr) = poz, pir, Vr) = por. (12.117) 
Prove that, independently of the specific Wy(r) radial profile, the remarkable 
identities 
OPr Pt lolnp; 
= : sl = ; 12.118 
p(r) Ww Br) op, > Ine ( ) 


hold (e.g., see Baes and van Hese 2007; Bertin et al. 1994; Dejonghe 1986; 
Dejonghe and Merritt 1992; Spies and Nelson 1974), where the anisotropy profile 
B(r) is defined in Eq. (12.34). 

This exercise introduces the differential energy distribution, defined so that the mass 
(or number) of stars with energy in the range (€,€ + dE) is given by dM(E) = 
N(€)dE. Show that for an isotropic, spherical system in a total potential Wy, we 
have 


r(€) 
N(O)=FORE), gO= 16x? [ r?,/2(Wr — E)dr, (12.119) 
0) 


where W7[r(€)] = E (for interesting properties of this function, see Binney 1982b; 
Binney and Tremaine 2008; Ciotti 1991). Hints: In the expression for the total mass 
M = 4r ie r2p(r)dr, insert the expression for p in Eq. (12.15) and invert the 
order of integration in the plane (7,€). 
The family of tangentially anisotropic OM models is obtained from Eq. (12.41) with 
JZ 
Q=E+-—. (12.120) 
r; 


a 


12.6 
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Show that 


r2 


Bir) == 


: 12.121 
ap ne 


Note that the velocity dispersion tensor is still isotropic in the center, but now 
is increasingly tangentially anisotropic with increasing radius. Note also that, at 
variance with the radially anisotropic case, Eq. (12.121) can be applied only to 
spatially truncated systems, provided that rag > 7, where 7; is the truncation radius 
(.e., o(r) = O for r > 7). Because of this limitation, these models are not of 
much use. 
Consider the DF of the isothermal sphere in Eq. (12.79). By integration over the 
velocity space, show that 

p= wer a2 =o", (12.122) 


UL 


and then recast the Poisson equation (12.72) in terms of the density as 


1 d ( ,d\np 4nrG 
= : 12.123 
r2 dr (" dr ) o ( ) 
Finally, use p = p,(r/r,)-® as a trial solution and prove by substitution into 


Eq. (12.123) that a self-gravitating singular solution exists for a = 2 and p iv? = 
o2 /(2xG), coincident with the singular isothermal sphere in Eqs. (2.63)—(2.64) for 
ve = 2c”. Of course, a regular solution of Eq. (12.123) also exists, but it cannot 
be expressed in terms of elementary functions (e.g., see Binney and Tremaine 2008; 
Chandrasekhar 1939). 

Show that the DF of the isothermal sphere in Eq. (12.79) can be obtained as the limit 
for n — o of the DF of the polytropic sphere (12.69), once Eq. (12.78) is used. 
Hints: Write A, in terms of B, from Eq. (12.70) 


AP rat) 
" Qn)3/202" Pa = 1/2)(n + 


and fix € as in Eq. (12.78). Compute the limit of the resulting Eq. (12.69) for n > 
oo by using Stirling expansion in Eq. (A.47) and show that Eq. (12.79) is recovered, 
with p, = poe Yo/ Bo and o = op. Asa consistency check, also compute the limits 
for n — oo of the density and velocity dispersions in Eqs. (12.70) and (12.71) and 
reobtain Eq. (12.122). 

Consider the King DF in Eq. (12.80). Integrate Eq. (12.15) and show that in the 
self-consistent case 


je oe .* Erf(v6) 2 (: *)| 4(), (12.125) 
Weis 3 


where € = E,/o?, >= W/o? — €, and Erf is the error function in Eq. (A.57). 
Obtain the analogous expression for the velocity dispersion. 


(12.124) 
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12.9 Show that, with the change of coordinates in Eq. (12.92) suggested by Figure 12.3, 
Eq. (12.93) holds, and from computation of the Jacobian, prove Eq. (12.98). 


12.10 By using Eqs. (9.41) and (9.42), show that from the identity v = v,»e, + vyey + 
Uz; = U;f + vefs + vyfy, the Cartesian and spherical phase-space velocity com- 


ponents are related as 
Vy = Up SIND COS P + Vy COS F COS Y — Ug Sin g, 
Vy = vy Sind sing + vy cos F sing + Ug COS gy, (12.126) 
Uz = Up COS BD — vg sind. 


Then consider a spherical stellar system with f = f(€,J7), recall that a. = Cr 


and from averaging over the velocity space deduce that 


Vy = Vy = v; = 0, 


a2, = 0, sin? 3 cos” y + 04 (cos? 9 cos’ y + sin’ g), 

a = o sin? 9 sin? g + 0, (cos” 0 sin? y + cos? 9), 

02, = 0, cos” 3 + of sin” 9, (12.127) 
C= (o? — a9) sin’ 0 sing cos, 


= 2) «3 
Os, = (07 -— Og) Sin V Cos ¥ Cos g, 


2 ~(g2_ ah : 
Oy. = (0; oo) sin} cos } sing. 


12.11 By using Eqs. (9.41)-(9.43), show that from the identity v = vye, + vyey + ve; = 
urfr + vofy + vf, the Cartesian and cylindrical phase-space velocity components 


are related as 
Vy = VRCOSY — Ug SING, Vy = VURSING+ VgCOSY, Vz = Vz. (12.128) 


Then consider an axisymmetric stellar system with f = f(€, Jz), recall that ae = 
a and from averaging over the velocity space deduce that 


Vy = —Vgsing, Vy =VUgcosgy, vVz,=0, 


Bo ee ae ad 2 ed 
Oxy = ORCOS Y + OG SIN” &, 


ay = oe sin? g + o5 cos? ¢, 
5 ‘ (12.129) 
07, = OR, 
2 
a= (oR — 04) sin g Cos g, 


Co. = 0. 
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Show that the spherical and cylindrical phase-space velocity components of v = 
orf, + vofy + vgfy = urfr + vofy + vzf; are related as 


Up = URSINV + vVzCOSU, Vy = URCOSY — vz SIND, Ug = Ug. (12.130) 


Consider again an axisymmetric stellar system with f = f(€, J) and show that 


Or = O%5 = oR =o} 

: : (12.131) 
Cop = Cp: 

a rr: 
O79 = O79 = 95, = 9 


An important model that is often used to describe self-gravitating planar distribu- 
tions (both in stellar dynamics and in fluid dynamics) is the so-called isothermal 
sheet (Bertin 2014; Binney and Tremaine 2008; Spitzer 1942). It is obtained (see 
the precautionary note in Chapter 1!) by assuming a density distribution stratified 
on z and independent of x and y, with the DF (12.79) and € = (z) — v?/2. First, 
show that the density can be written as in Eq. (12.122) and that the Poisson equation 


for 6 = —W in the self-gravitating case reads 
a d 
ad —4AnGpie9/", (0) =O, o@) = 0, (12.132) 
dz dz z=0 


where the two conditions at z = 0 are fixed so that the density value p; in the 
plane is given as a boundary condition and the vanishing of the derivative imposes 
regularity. Integrate Eq. (12.132) or show by substitution that the self-gravitating 
solution is 


p(z) =p cosh~2 (ee) : b(z) = —o2 In — (12.133) 
oO 1 


An interesting family of power-law DFs discussed in Ciotti et al. (2004) is obtained 
from the models in Section 12.3.2, assuming a power-law as a special form of the 
function h (1.e., h(Q) = AQP; see also Rowley 1988), so that 
f = fy = Ald)" 0"0(0), a >-1/2, B>-l. (12.134) 
From Eq. (12.102), show that 
_ An2*t5/? Ba + 3/2, 8 + 1) 


2a, ¢+B-+3/2 
P= Oa FI + R2/Raeria Rr 4(Wr), (12.135) 


and obtain the analogous expressions for om and oy. What happens for B = n 
positive integers? Hint: Expand f with the binomial theorem and consider a sum of 
Fricke DFs. 


13 
Modeling Techniques 2 


Moments Approach 


In this chapter, we discuss the complementary approach to that presented in Chapter 12 
for the construction of stationary, multicomponent collisionless stellar systems. The Abel 
inversion theorem is introduced, and then a selection of density—potential pairs of spherical, 
axisymmetric, and triaxial shapes commonly used in modeling/observational works are 
presented. We finally discuss the solution of the Jeans equations for spherical and axisym- 
metric systems, and among other things we show how to compute the various quantities 
entering the virial theorem. For illustrative purposes, we use some of the derived results 
to investigate the possible physical interpretations of the fundamental plane of elliptical 
galaxies. 


13.1 The Construction of a Galaxy Model: Starting 
with the Jeans Equations 


One difficulty with the f-to-o approach illustrated in Chapter 12 is that there is little 
control over the resulting profiles of density and over the other dynamical quantities, even 
though when available these solutions are of great interest because they represent physically 
acceptable equilibrium models (which are not necessarily stable) of stationary, collisionless 
stellar systems. 

An alternative (and, when possible, complementary) approach to the construction of 
models describing stellar systems, based on the Jeans equations (see Chapter 10), is very 
useful for its ability to relate observationally accessible quantities, such as the streaming 
velocity and velocity dispersion, to other interesting but directly unaccessible quantities 
(e.g., the potential and the total density field of a given galaxy); we refer to this approach 
as the p-to- f approach. However, before proceeding with the illustration of the method, 
the student should fully appreciate that in the p-to- f approach, in general the underlying 
distribution function (DF) cannot be determined uniquely, and even when it is (under 
quite restrictive assumptions), its numerical recovery is an example of an “ill-conditioned” 
problem. In fact, as discussed in Chapter 10, the macroscopic properties of a stellar system 
described by the Jeans equations are moments over the velocity space of the DF of the 
system, and from a mathematical point of view, physically acceptable moments can also be 
originated by an unphysical (i.e., somewhere negative) DF. Thus, in general, even though 
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the approach described in this chapter is easier to apply than the f-to-o approach, the 
validity of the results obtained is not guaranteed a priori. Fortunately, the situation is not 
desperate; in fact, under special circumstances, it is possible to obtain sufficient information 
about the DF associated with the system and to test its properties, which is the subject of 
Chapter 14. 

Qualitatively, we can summarize the main logical steps of the p-to-f approach as 
follows: 


(1) Guided by theoretical/numerical/observational indications, a specific form for the 
density p (or for the projected density ©) of the component of interest is assumed. 
Different density components can be allowed in order to simulate the presence of a dark 
matter halo, of a central supermassive black hole (BH) or of other stellar components 
with different ages, chemical compositions, mass-to-light ratios, and so on. In general, 
the mass-to-light ratio Y,. of each density component is assumed to be independent of 
position, but its numerical value can be different for each density component. 

(2) For each density component, the associated Jeans equations are solved either analyti- 
cally or, more often, numerically, and their solutions are projected using the techniques 
presented in Chapter 11. The closure of the Jeans equations is in general obtained by 
imposing specific ansatz (hopefully physically motivated), such as with the task of 
reproducing the main features of observed (projected) velocity fields. 

(3) When possible, the underlying DF is recovered and its positivity is investigated. In case 
of negative values, the DF cannot be accepted, and then the model obtained must be 
discarded, even if it is successful at reproducing observations. The recovery of the DF 
is usually the least trivial step of the p-to- f approach and, when ignored, can leave the 
physical significance of the entire process under serious concern. 


In this chapter, we focus on Points (1) and (2) above, restricting ourselves for simplic- 
ity to some of the most common spherical and axisymmetric models. Point (3) will be 
discussed in Chapter 14. 


13.2 The Choice of the Density Distribution 
13.2.1 Spherical Models 


Spherically symmetric models (single-component or multicomponent) are the simplest 
models used in stellar dynamics to describe stellar systems. They are usually sufficiently 
simple to allow an almost complete analytical study, and so they can be used as useful 
starting points before embarking on time-consuming investigations based on more realistic 
models. In general, the first step in the construction of a model is the choice of a density 
distribution reproducing the relevant properties of the projected/spatial density (luminosity) 
profile of the system of interest, so that the relation between spatial and projected density 
for spherical systems needs to be clarified. This is accomplished by using a very general 
result that comes from Abel (1802-1829). Let g;(x) and g2(x) be two sufficiently regular 
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functions defined over the interval x» < x < xm. For 0 < a < 1, the following 
identities hold: 


_ p* gid _sin@re) d f* fi(ndt 
fix) = a 81) = — OEE (13.1) 
M go(t)d d ¢™ fy(t)d 
fis) = f haan g2(x) = “ne =| rani (13.2) 


and the verification of the inversion formulae above is given in Exercise 13.1. This impor- 
tant theorem has innumerable applications in mathematics and physics (e.g., see Gorenflo 
and Vessella 1991; chapter 3 of Landau and Lifshitz 1969), and even restricting ourselves 
just to astronomy (e.g., see Ciotti et al. 1995; see also Chapter 14), this is vital not only 
in stellar dynamics, but also in areas such as the modeling of hot gaseous atmospheres of 
elliptical galaxies and clusters of galaxies (e.g., see Ciotti and Pellegrini 2008). 

Let us now consider a spherically symmetric stellar system described by a mass density 
p(r) stratified on the spherical radius 7; we indicate with X(R) the projected mass density, 
where R is the radius on the projection plane (see Chapter 11), and for simplicity, in the 
following, we only consider the case of a spatially constant mass-to-light ratio Y,,, so that 
p(r) and &(R) are related to the light density v(r) and to the surface brightness profile 
I(R) by the elementary p = Y,v and © = Y,/, respectively.! Abel’s theorem can be used 
to relate o and & (and v and /) as follows: assume 


A=prjan—r), Ut = UCR/E(R — R), (13.3) 


where 7, = R; is the so-called truncation radius, and when 7 = Rt = ov, the system 
is said to be (spatially) nontruncated. From Figure 13.1, simple geometric considerations 
show that in a transparent system (i.e., a system where no two stars are exactly aligned 
along the line of sight (Jos); see also Exercise 11.2) 


srya2 feet pe 13.4 
(R) = [ Jfa t=lt (13.4) 


and from Exercise 13.2 
1 f d&X(R) dR =x(R) 
p(r)= aR = . 
r WR =F r/R? — 2 


(13.5) 


A check can obtained by inserting Eq. (13.5) into Eq. (13.4), inverting the order of inte- 
gration, and finally using the Euler integral (13.133) with a = 1/2. Note that even if 
p(t) > 0 (in practice, the system is spatially truncated with a finite “jump” at 7), neces- 
sarily U(R,) = O. It is then not a surprise that if &(R,) > O, then the associated spatial 
density must be (weakly) divergent at the boundary of the system, p(r) ~ @ =g)\7 


| In the case of stellar systems with a spatially dependent mass-to-light ratio Y,,(x), such as multicomponent systems with 
different stellar populations, caution is needed when relating intrinsic and projected density/luminosity fields (e.g., see 
Cappellari 2002; Caravita et al. 2021). 
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Figure 13.1 Geometry of the projection of a transparent spherical system. The Jos is directed along 
the z-axis, and sor? = R* +z”, where R? = x7 + y’. By definition of projection, U(R) = 


4/ R2—R2 
216 : p(V R2 +22)dz, so that Eq. (13.4) is obtained with a change of the integration 
coordinate from z to r, at fixed R. Integration along the horizontal line S defines the strip brightness 
S(x) in Eq. (13.15). 


the student is invited to explore the behavior of &(R) near R; for p(r) ~ (r2 —r?)“ asa 
function of a. 

Before listing some of the most used spherical models that are encountered in the 
literature, we recall the relevant quantities that characterize a spherical density profile. The 
projected mass Mp inside a circle of radius R on the projection plane of a system with 
surface density & is 


R 
My(R) = 2x f u(t)t dt, (13.6) 
0 


and an identical expression relates the projected luminosity Lp and the surface brightness 
profile J when considering the spatial luminosity density v instead of p. 
When Lp(R;) is finite, the effective radius Re of the system is defined as 


Lp(R) 
2 


(i.e., Re is the radius of a circle in the projection plane containing half of the total luminosity 
of the density component under investigation). It is important to recall that real galaxies, 


Lhe) = (13.7) 
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when observed in different spectral bands, are usually characterized by different values of 
Lp(R:) and Re, the latter being a clear indication of population/metallicity/age gradients 
inside the system. When / (0) is finite, another important scale length of the system, the 
core radius R., is defined as 

I(0) 


Finally, we recall that the mass contained inside a sphere of radius r is given by 


M(r) =4n [ p(t)t? dt, (13.9) 
0 


and an identical formula relates the spatial luminosity L(r) and the light distribution v(r). 
When M (1) is finite, the spatial half-mass (light) radius ry is given by 


M(mn) = es (13.10) 


From Egs. (13.4)—(13.6), it is easy to show that Mp(R,) = M(7) and thus to prove, in 
this explicit case, the validity of Eq. (11.7). A very simple geometric argument shows that 
Re < rp holds independently of the specific form of o(r) (simply imagine the surface 
integral of the projected density inside the radius R as given by the sum of three volume 
integrals: a sphere of radius R and the two remaining parts of the projection cylinder once 
of the sphere is removed, then integrate up to R = rp and deduce the statement; see also 
Exercise 13.3 for an explicit example). It should be fairly obvious that in the case of a 
spatially constant Y., the quantities Re, Rc, and rp, can be defined indifferently over the 
mass or over the light profile, being Mp = Y,.Lp. 

In the construction of a model for a stellar system, potential and gravitational energies 
are quantities of fundamental interest. We have already encountered them in general cases 
in Chapters 2 and 10, and here, in preparation for the following discussion, we give their 
expressions for spherical systems in presence of an external potential ¢ex:(7). The gravita- 
tional potential produced by the density distribution in Eq. (13.3) is obtained from Eq. (2.5) 
by considering r9 — ox, so that 


GM(r) n 
—— +4nG p(r)rdr, OK<rK<nr, 
o(r) =— ee) r (13.11) 
aa r = Ts 
7 


and from Eqs. (10.49)-(10.54) the associated self-gravitational energy can be written as 


U=2n [ p(r)o(r)r-dr = —an6 [| M(r)p(r)rdr = W, (13.12) 
0 0 


where the last identity derives from Eq. (10.54). If, in addition, a spherically symmetric 
external potential ¢ext (7) is present (e.g., that of a dark matter halo or of a central BH), then 
an integration by parts of Eq. (10.50) with respect to the enclosed mass shows that 
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. 2  M(r)Mex(r) 
Us =a | pirrdbeulryrdr = Minder) — 6 f° RPE ar, 13.13) 
0 0 


while the interaction energy entering the virial theorem is obtained from Eq. (10.52) as 
" 
Wext = ~4n6 | Mex(r)e(r)rdr. (13.14) 
0 


In spherical systems, a quite remarkable identity relates the projected mass (luminosity) dis- 
tribution with the potential and self-gravitational energy. Following Schwarzschild (1954), 
we the introduce the concept of strip brightness. Let I(R) be the surface brightness profile 
associated with the light distribution v(r), where p = Y,v, R = \/x?+4+ y?, and Y, is 
constant. From Figure 13.1, the strip brightness S(x) is defined as the integral of J along y 
at distance x from R = 0, and in Exercise 13.4 we show that the following relations hold: 


R2—x2 Rt I (R)R rt 
1S 2f I(R)dy =2 ‘ yer® - an | v(r)rdr, (13.15) 
1 dS(r) e 
v(r) = ——— , M(r)=2x% | S(x)dx —r sw] , (13.16) 
2nr dr 0 
R, 
U= -2673 | S?(x)dx. (13.17) 
0 


Note that S(R,) = S(%) = 0, so that for r > 7; the second identity of Eq. (13.16) gives 
M(r,). Moreover 


or) =- 


2GY, [" RX 

. [ S(x)dx, (0) = 46%, | I(R)dR (13.18) 
(a result apparently unnoticed in Schwarzschild 1954; see Ciotti 1991 and Exercise 13.5; 
see also exercise 2.15 in Binney and Tremaine 2008). Finally, a quite interesting expression 
of U in terms of &(R) can be obtained with some work from Eq. (13.17), as is shown in 
Exercise 13.8. The interesting point of these results is that in some cases they allow us 
to obtain explicit expressions for dynamical quantities in systems defined by the surface 
density profile, when Abel inversion cannot be performed in terms of elementary functions. 


13.2.1.1 A Selection of Spherical Models of Common Use 


Here, we list some of the most used density and surface brightness profiles of spherically 
symmetric stellar systems. These models are relatively simple, yet they can represent rea- 
sonably well the density profiles of (early-type) galaxies or other spheroidal stellar systems. 
Of course, the following must not be considered by any means complete, and the student 
is strongly encouraged to dedicate some time to the exploration of the literature. We begin 
with models defined by &(R) or 7(R). 


¢ Sersic R'/" law (Sersic 1968) 


1(R) =1O)e2""— gp = (13.19) 
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where R- is the effective radius and b(m) is determined below. It has no free parameters 
and depends on two physical scales, the effective radius R. and the central surface 
brightness /(0). For m = 4, it reduces to the famous de Vaucouleurs (1948) R!/4 Jaw, 
with b(4) ~ 7.67. For m = 1, the formula becomes the exponential profile adopted 
to model the face-on surface brightness of disk galaxies. Due to the relevance of Sersic 
law in the observational works of early-type galaxies, some additional information is 
presented. From Eq. (13.6), the projected and total luminosities are given by 


22mm 
© p2m 


42mm 
e p2m 


Lp(R) = 1(0)R y(2m,by!/"), L=1(0)R T(2m), (13.20) 


where y(a,x) is the incomplete gamma function in Eq. (A.51). From Eq. (13.7), it 
follows that b(m) is the solution of the transcendental equation 
(2m) 
2 
and standard asymptotic analysis (e.g., Bender and Orszag 1978; Bleistein and Handels- 
man 1986; de Bruijn 1958) shows that for m — oo, 
=F : =f a ++ 
3 405m 25515m? 


(see Ciotti and Bertin 1999; see also Prugniel and Simien 1997). A slightly more difficult 
asymptotic analysis (Baes and Ciotti 2019a) shows that form — 0 


y (2m, b) = (13.21) 


b(m) ~ 2m (13.22) 


1 2 
b(m)2”" ~ mT (2m) = 5a ymt (= + ) Mm? +0 (13.23) 


where y = 0.577 ... is the Euler—-Mascheroni constant; therefore, b(m) ~ 1/ (/2)1/ m_s 
0 for m — 0. The luminosity density v is given by Eq. (13.5) and unfortunately it cannot 
be expressed in terms of elementary functions for generic values of m. However, it can 
be proved (Ciotti 1991) that for r — oo 


I 0 b m 
poy ~ (0) eos! (12m) /2m s= ca (13.24) 
Re. V 2am Re 
while for r > 0 
b"Td—m) 
———,, 0<m<l, 
TT 
I(0) bd) 2 
v(r) ~ —— x is "Bs m=\1, (13.25) 


B[1/2, (m = V)f2m) asim 0m) /m_ 


m> 1. 
2mb-1 


For m > 1, the density diverges at the origin as r'~””)/"", and therefore the divergence 
of p is worse for higher-m models: for m = 4, p diverges as r~*/* for r — 0. Several 
dynamical properties have been computed (e.g., see Baes and van Hese 2011; Binney 
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1982b; Ciotti and Lanzoni 1997; Graham and Driver 2005; Poveda et al. 1960; Young 

1976 and references therein). The central potential is obtained from Eq. (13.18) as 

40 (m + 1) 
pm 

and a quite surprising integral expression for U is finally given in Exercise 13.8. We 

conclude by mentioning the Einasto models (Einasto 1965; Graham et al. 2006 and 

references therein), the three-dimensional analogs of the Sersic law, which are found 


$(0) = —GY,1 (0) Re (13.26) 


to reproduce very well the structure of dark matter halos obtained from numerical simu- 
lations of structure formation. 
Hubble (1930)—Reynolds (1913) 
(0) 
(1+ R/Ry)?’ 


one of the first families of profiles used to describe the surface brightness profiles of 


Rs (13.27) 


elliptical galaxies. 
King (1962) 


. 1 

(R2+RR  /RP+ RR 
where the coefficient K can written in terms of 7(0). This family has been used to 
describe the surface brightness profiles of globular clusters. 

The following models are instead built starting from p(r), so that &(R) is a derived 
quantity. For a few we give also some additional formulae that can be useful in applica- 
tions, such as the relative potential V(r) = —@(r), while the deduction of other formulae 
(that can be found in the given references) is left to the student. We indicate with s = r/r¢ 
the spherical radius in units of the scale length r, of the density profile, and with n = R/r¢ 
the radius in the projection plane in units of rc. 

Power-law (untruncated, 0 < y < 3) 


1(R)=K 6(R, — R), (13.28) 


Pc MM. sy 3 
p(r) TS M(r) =". 3 M, = 4 pcr. (13.29) 
sy 3-y 
For mass convergence near the center y < 3 and for y = 2, the density profile is also 
known as the singular isothermal sphere (SIS; see also Exercise 2.1). The potential is 
obtained from Eq. (2.5) with y = oo and 79 = oo for2 < y < 3 or7p = O for 


O<y <2 
GM. ay 
g(r) = —— 
re B-y)2-Yy) 
For y > 1, the projected density and projected mass profiles are 


ly=1)\ 4 M, 1 pol) = 
X(R) = percB ( =, —— Y~ M)(R) = B{ =, eo. (13:31 
(R) = pele (; 5) )n p( ) 23 —y) (5 2 )n ( ) 


(13.30) 
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where B(x, y) is the complete beta function in Eq. (A.53). Note how & is shallower than 
p, a general consequence of projection. For other properties of these models, see also 
Exercise 3.1. 


Quasi-isothermal sphere (untruncated) 


p(0) 


p(r)= ea. M(r) = Amp (O)r3(s — arctans), (13.32) 
/ 2 
b(r) = 4 Gp(Oyr2 ( _ ah swan) (13.33) 
Ss 


where the potential is obtained from Eq. (2.5) with ro = 0 and imposing @(0) = 0. This 
model is characterized by a constant circular velocity at large radii, with the value given 
by (prove it!) v2 = 41Gpo (0)r2. As an exercise, compare the behavior of the potential 
for s — oo with that of the SIS in Eq. (2.64) and comment on the result (consider what 
happens to this model for r, — 0 at fixed vc). 


King (1972) (untruncated) 


p(0) 7 s 
p(r) = (2+ Ds? M(r) = 42rp(O)rz | arcsinhs — rae ; (13.34) 
, arcsinhs 
Wir) =421Ge(0)r; : (13.35) 
s 


Note that the total mass of the profile diverges; however, from the discussion in Chapter 
2, it follows that the potential can be fixed to zero at infinity (for additional properties of 
this model, see Exercise 3.2). 


Plummer (1911) 


M 1 Mir) Ms? 
—— a Pr) => 5a 
Anr3 (s2 4 1)9/2 (s2 + 1)3/2 


wr) GM 1 
r= ———-. 
reo Vs* +1 
This is one of the most important models encountered in stellar dynamics due to its 
numerous relevant properties (see Chapter 12 for the relation with the polytrope of index 
n = 5 and see Exercises 13.3, 13.17, 13.18, and 14.2). 


B-models (6 > 1) 


p(r) = (13.36) 


p(0) p(0)reB(1/2, B/2 — 1/2) 
7 (R= 4 DE-DE 


(13.37) 


The cumulative mass M(r) and the potential (7) can in general be expressed (do it!) 
for generic values of 6 in terms of the incomplete beta function in Eq. (A.55). Note that 
the quasi-isothermal sphere in Eq. (13.32), the King (1972) model in Eq. (13.34), the 
spherical limit of the perfect ellipsoid in Eq. (2.93), and the Plummer (1911) model in 
Eq. (13.36) can all be seen as special cases of B-models, obtained respectively for 
B = 2,3,4, and 5. 
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¢ y-models (0 < y < 3) (Dehnen 1993; Tremaine et al. 1994) 


Ot (3—y)M M(r) = Ms? = ro (13.38) 
Ow anrasts DRY TN GED BO , 
1 Ss | 
1 ( ) » V2 
M = 
#oj 364 1 C=Y) — (13.39) 
lc 1 
In saa » YH 2: 


s 
It is an interesting exercise (do it!) to show that at re — 0 the potential of the 
y-models reduces to that of a point of mass M. From Eq. (13.38), it follows that the 
density of y-models is a power-law r~Y at small radii (i.e., for r < rc) and a power-law 
r~* at large radii, and due to their simplicity and flexibility, these models are routinely 
used when modeling the density distribution of early-type galaxies. Special cases of 
great importance in theoretical and numerical works are the Jaffe (1983) and Hernquist 
(1990) models, obtained respectively for y = 2 and y = 1, with Re ~ 0.75 re and 
R. ~ 1.82r¢: these two profiles reproduce the R!/4 profile in projection remarkably 
well. Generalizations of these models can be found in Burkert (1995), Evans et al. (2015), 
and Zhao (1996). For other properties, see Exercices 3.3 and 13.6. 
Navarro—Frenk—White (NFW) profile (Navarro et al. 1997) 


M GM 1 1 
p(r) =—> » Me MD, wy = SEBS 3.40) 
Anrs f(c)s(s + 1) fc) lc sf(c) 
where M is the mass contained inside the radius r; and so 
Ss rt 
f(s) = In(s + 1) . t=: (13.41) 
stl re 


Note that the total mass of the profile diverges; however, from the discussion in Chapter 
2, it follows that the potential can be fixed to zero at infinity. This profile is found to 
reproduce the density radial trend of the cosmological dark matter halos obtained in 
numerical simulations. 

Hénon (1959) isochrone 


Wir) GoM d (13.42) 
r) = —_ —_ ==, Z 
re 1+vVJs2+1 


where M is the total mass, and from Newton’s second theorem 


Ms? ©) M © 2s? 4+3(14+ vs? +1) 
— ; p ; al it a 
Vs? + 1(1 + vs? + 1)? Amr (2 + 1)3/2(1 + Vs? + 1) 


This is a very important model (most easily introduced starting from the potential) due 


M(r) 


(13.43) 


to the fact that the period of orbits is independent of the orbital angular momentum and 
is only dependent on orbital energy (see Binney and Tremaine 2008 and, in particular, 
Binney 2014 for a very clear and complete discussion of the model). 
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13.2.2. Axisymmetric and Triaxial Ellipsoidal Models 


Along the lines adopted to introduce the main structural properties of spherical models, we 
now consider models departing from spherical symmetry, starting with the general family 
of triaxial and axisymmetric ellipsoidal models. In the axisymmetric case, the natural coor- 
dinates for these systems are the cylindrical ones (R, ¢, z), with all of the model properties 
independent of the azimuthal angle g. After ellipsoidal models, we will describe other 
families of nonspherical models of use in stellar dynamics, but the student is warned that 
many more families can be found in the literature. 

We already encountered ellipsoidal models in Chapter 2 (see in particular Section 2.2 
and related exercises). As we have seen, a very simple procedure that can be adopted in 
order to generate families of triaxial (or axisymmetric) models starting from a spherical 
density distribution p(r) = poP(r/ay) is based on the substitution (2.26). This is equivalent 
to the transformation of the spherical radius to 


z2 
re ae Sane RG RP, (13.44) 
Q 


where gy = az/a, and g, = a3/a, are the ratios of the semiaxes of the isodensity ellip- 
soidal surfaces. In general, for gy = q;, the model is axisymmetric around the x-axis, for 
qz = | around the y-axis, and for gy = | around the z-axis. The standard axisymmetric case 
is obtained for gy = 1, and qg, is responsible for the model axial flattening. For 0 < gq, < 1, 
the density is stratified on oblate ellipsoids, for gz = 1 on spheres, and for g, > 1 on 
prolate ellipsoids. Usually (but not always), we will follow the convention in Eq. (2.27) 
(i.e., the shortest axis of the ellipsoidal surfaces is along the z-axis, the intermediate axis 
along the y-axis, and the long axis along the x-axis). With this substitution, the density of 
our system is given by Eqs. (2.28) and (2.74), ie., 


- 1 2 2 
p(x) = PON a —m), m= —/x?+ a = (13.45) 
Dy Qz a Wy Az 


where m; fixes the truncation surface m = m;; for my = oo, the model is untruncated. 

In analogy with Section 13.2.1, we now consider the most important properties concern- 
ing the projection of the ellipsoidal density in Eq. (13.45). For the sake of generality, we 
do not require that the mass M of the model is finite, so that po is simply a density scale 
not necessarily equal to M/ (47a; ). The astrophysical relevance of the problem should 
be clear, as quite often early-type galaxies can be satisfactorily described as ellipsoidal 
bodies. Several treatments of the projection of ellipsoidal (transparent) stellar systems are 
available (see, among others, Binney 1985; Lanzoni and Ciotti 2003; Stark 1977, and, 
in particular, Franx 1988a,b for the relevant problem of the projection of Stackel stellar 
systems); however, as different conventions are adopted to define the projection angles, 
for consistency here we follow Chapter 11. To project the galaxy density on the plane of 
the sky (the projection plane z'), we start from the system So = (O;e1,€2,€3) with the 
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origin at the center of the density distribution and with the axes aligned with the ellipsoidal 
axes. In So, the generic position vector is written as kx = xje; = x1e; + x2e2 + x3e3. The 
los direction in So is indicated by the unitary vector n pointing toward the observer, and 
it is specified by using standard spherical angles 3 and g, as in Eq. (11.2). The observer 
system S’ = (O’;f,f2,f3) is nonrotating and O = O’ at all times. The position vector in 
S’ is € = &f; and n coincides with f3 (i.e., the observer’s “eye” is placed at infinity on the 
&3-axis). The projection plane zr’ then contains the origin, and it is perpendicular to n. This 
is given algebrically by 3 = 0, and it can be identified with the set of two-dimensional 
vectors €,; = (&,&). From Chapter 11, it follows that the orientation of S’ with respect 
to So is given by a 3—2-3 rotation; the first (0 < m < 27) and second (0 < 0 < z) 
rotation angles coincide with the spherical angles of n in So, and the resulting rotation 
matrix R = R3(~)R2(V7)R3(wW) is given by Eqs. (11.46)-(11.48), with x = Ré and 
n = ?f3. Note that the rotation R3(y) around f3 simply rotates the projected image in the 
plane z’, and so for the present purposes here we fix y = 0; in other words, the projected 
image is intrinsically determined only by the angles g and ?. 

The integral defining the density projection X(& | ) is given in Eq. (11.5) and, as already 
remarked, this definition assumes that the observer is at infinite distance from the system 
and that the system is “transparent” (i.e., all of the information along n is summed at & | in 
the projection plane; see Exercise 11.2). Moreover, according to Eq. (11.7), the integration 
of X over z’ equals the total mass (luminosity) of the projected system independently of 
the Jos direction. In order to proceed with the projection, it is convenient to introduce the 
diagonal matrix 


1 
= 6 1 0 0 
ay 
il 1 1 
ay ay qy I 
1 
0 0 = 0 0 3 
ae qz 
so that from Eq. (2.26) 
m? = (Dx,x) = (E&,é), E=R'DR, (13.47) 


where the matrix E depends on the projection angles and is apparently symmetric and 
definite positive. We then write € = &, + &3f3, where for the moment we suspend the 
convention €, = (&,&) and we assume instead &, = (1, &,0), so that Eq. (13.47) 
becomes 


m = we? + 2(Ff3,€ |) + (£§,,€,) 


2 
= (e + i) a an (13.48) 
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where (prove it!) 


a> = (Ef3,f3) = (Dn,n) 


1 in? ein? 29 
=i teas ee ed (13.49) 
ay dy qz 
and 
(Ef3,€ 1)? 
e = (E&,,6,)- "= = (HE ,,81) (13.50) 


is a positive definite* quadratic form. Notice that at fixed & , , from Eq. (13.48) m? attains 
the minimum value 7 at 


_ (Efs,& |) : 


5 (13.51) 


min __ 
z= = 
5 ? a 


In the last term in Eq. (13.50) we reverted to the convention in Chapter 11 of €; = (&1,&2). 
FH is the definite positive symmetric 2 x 2 matrix 


1 ( Ey — Ej, aE — ) 
: 


= = (13.52) 
a7 E>, — E13E23 a Ex — E}, 


and its entries can be easily computed for any choice of the /os orientation. Therefore, 
curves of constant € are concentric and coaxial ellipses in the projection plane. 

We can now proceed with the projection of the density distribution in Eq. (13.45). First, 
we express the density in the observer’s coordinates € by using Eq. (13.48) and by fixing 
& |; we indicate the new function as p(&). Then we consider the general expression of the 
projection operator in Eq. (11.5) and we evaluate the minimum (&3 ) and maximum (EF ) 
values of the integration variable setting m = m, in Eq. (13.48), obtaining 


5 = ie (13.53) 


Of course, for untruncated models, the integration domain is (—0oo, 00). We then split the 
integration domain in (§ ety and a gS ) and change the integration variable in the 
two integrals as t = a& — wer Recognizing that the two integrals coincide, we finally 


obtain 

’ PO <= 
ze.) = | p'(&)déy = —°_ 510), 

& dy z 

3 (13.54) 

S m2—€2 im 
Be sof" (VEE Ear =2 [POM am 

0 t me — &2 


2 From the positivity of m2 in Eq. (13.48), it follows that the determinant of the quadratic equation in &3 is negative, and this 
proves that >0. 
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Therefore, we proved that the projection of the triaxial ellipsoids in Eq. (13.45) produces 
surface densities stratified on ellipses € = const., and that the dimensionless function 
X(£) is the same as that appearing in the projection of the parent spherical case, as can be 
immediately recognized by considering the case gy = gz = | in Eq. (13.54). Incidentally, 
this also implies that the deprojection formula for the ellipsoidal case, recovering p(m) 
from &(€), can be again obtained from Abel’s theorem as in the spherical case, once the 
los direction and the two intrinsic flattenings are known (for more complicated cases of 
deprojection, see, among others, Binney and Gerhard 1996; Gerhard and Binney 1996). 

A few comments are in order. First, it is easy to check that in the triaxial case, and for 
Jos inclinations not aligned with the axes of the ellipsoid, the elliptical isodense € = const. 
in the projection plane is rotated with respect to the (f;,f2) axes, as the nondiagonal terms 
of the H matrix are in general different from zero. Second, being H symmetric, a rotation 
of the remaining angle w around the f3 axis, leading to a diagonal matrix H’, is always 
possible, so that the phenomenon known as isophotal twisting (i.e., the systematic rotation 
of the isophotal principal axes moving from one isodense to another; e.g., see Binney and 
Merrifield 1998) for the considered ellipsoids is impossibile. In Exercise 13.9, we show that 
in the axisymmetric case H is independent of g and already in diagonal form for wy = 0. 
Third, when H is in diagonal form, the diagonal elements of H’ are the two eigenvalues 
A+ > A_ of H, given by the two (positive) solutions of the characteristic polynomial 


d? — Tr(H)A + det(H) = 0. (13.55) 


Therefore, from Eq. (13.50), the semimajor and the semiminor axes of the isodensity ellipse 
£ = const. in the projection plane are (prove it!) a = €/,/A_ and b = ¢/,/h4, and so 
the (constant) axial ratio of the projected isodensities is given by g = b/a = J/A_/A4. 
Not only is isophotal twisting absent in our class of objects, but also the ellipticity of the 
isodenses is constant and depends only on the intrinsic flattenings and on the /os orientation. 
Moreover, the area encircled by the ellipse £ = const. is given by mab = m7/./h_A4, 
and as is well known from matrix algebra A_A+ = det(H), where from Eq. (13.52) 


: ; sin? 3 cos? gy + ven = = (13.56) 
—————. = 430 = Aj; qyqz : : 
Jdet(H) = a ae 


From Eq. (13.56), it is possible to recover without diagonalization, as a function of 
the Jos direction, the explicit expression for the circularized radius (R), defined as 
a(R)? = mab. Geometrically, (R) is the radius of a circle in the projection plane with the 
same area as an ellipse @, and so 


(R) = ajl./qyqzaa}. (13.57) 


Note that the expression under the square root is independent of a;. The encircled mass in 
the projected ellipse £ is then obtained as 


va 
Moo =f Ld, = 2a} poMplO), Nig = [ y(e’)e'de’. (13.58) 
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This formula is obtained by first rotating the coordinates in the projection plane so that 
H is in diagonal form, with d*&, = d& ‘., and then changing to polar coordinates &") = 
(t’ cos 6/./A_, ' sin 6/./A+). From Eqs. (13.54) and (13.58), the student should prove that 
for £ > oo, Mp(¢) — M. Moreover, in perfect analogy with the spherical case, we can 
define the effective ellipse €. as 


MC . I 
Mp(le) = 5. ie, Mp(le) = 5. (13.59) 


Of course, the value of ¢, is independent of q,, gz, and the Jos inclinantion and depends 
only on the dimensionless function p(m). Therefore, from Eq. (13.57), the circularized 
effective radius (Re) is given by 


(Re) = Re/qyqz0a}, (13.60) 


where R, is the effective radius of the parent spherical model. Note that the axisymmetric 
case gy = | is particularly interesting, as from Exercise 13.9 it is elementary to prove that 


q= cos? 3 +q2sin? ¥ = gaa, (Re) = ReJ/q, (13.61) 


so that g = | in the face-on view and q = q; in the edge-on view. 


13.2.3 Additional Families of Axisymmetric and Triaxial Models 


Of course, other relatively simple axisymmetric systems not described by ellipsoidal distri- 
butions are known. Among the models in significant use (this list is by no means complete), 
we may recall 


¢ Miyamoto—Nagai models (Miyamoto and Nagai 1975; Nagai and Miyamoto 1976) 


ey eat ee (13.62) 
R2+ C2 
2 2 2 2\ 2 
ee Mb? aR? + (€ +2V 2° + b-)e (13.63) 


An (R2 ae €2)5/2(z2 + b2)3/2 : 
with a characteristic disk-like shape (e.g., Binney and Tremaine 2008). From inspection 
of Eq. (13.62), notice how these models reduce for a = 0 to the Plummer sphere and for 
b = 0 to the Kuzmin razor-thin disk in Eq. (13.69). 

¢ Satoh models (Satoh 1980) 


W(R,z) = =. C= (Rr? +22+a(a+2V 22 +b), (13.64) 
Mab? 1 3 R24 22 
p(R,z) = int A BD | ass + (1 72 I. (13.65) 


another family of disk-like stellar systems. 
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¢ Binney’s logarithmic potential (Binney 1981; Binney and Tremaine 2008) 


2 pe igh 
Mos mle 2), (13.66) 
2 a2 
2 1 2 2 42 R2 a | D\ ot 
o(R,z) = Uso + 2q°)ae + + ( /q°)z (13.67) 
4nG qz(a* + R? + 27/q*)? 


where a is a scale length and vo is the asymptotic (constant) circular velocity in the 
equatorial plane (prove it!), and for positivity of the density g < 1/ ./2. For values of 
q near the positivity limit, the density distribution presents a characteristic torus-like 
structure. 


Evans’ power-law models (Evans 1994; Evans and de Zeeuw 1994) 


R2 27,2\ —B/2 
att) (13.68) 
a 


WR) = ¥O.0 (14 : 


where a is a scale length, and 6 ¥ 0, and the associated density distribution can be easily 
calculated. In fact, the Laplacian of potentials stratified on ellipsoidal surfaces, such as 
those in Eqs. (13.66)-(13.68), can be easily obtained following Exercise 2.9; note that 
the potential in Eq. (13.68) is the power-law analog of Eq. (13.66), and in fact for these 
models the density distribution also presents a characteristic torus-like structure for qg 
near the positivity limit. 

Kuzmin disk (Kuzmin 1956; Toomre 1963) 

GM Mad(z) 


__ R,z) = => 
UR atie P > Rae 


where the student should prove that R;, = ./3a is the half-mass radius and verify the 
potential—density pair of this razor-thin disk obeys the Poisson equation by using the 
Gauss theorem. 


W(R,z) = (13.69) 


As anticipated, several others families of nonspherical models are available. The study of 
the properties of these systems — sometimes built using quite advanced mathematical tech- 
niques — is beyond the task of an introductory book, and the student is strongly encouraged 
to explore the literature. Among these models, we can briefly mention the scale-free models 
(Evans et al. 1997), the flattened isochrone (Evans et al. 1990), the axisymmetric and 
triaxial models built using the homeoidal expansion technique described in Section 2.2.1 
(e.g., see Ciotti and Bertin 2005; Ciotti et al. 2020; Riciputi et al. 2005), the models 
built with variants of the Miyamoto—Nagai technique (see An and Evans 2019; Evans and 
Bowden 2014 and references therein), and a few other illustrative cases described in the 
following sections. 


13.2.3.1 Stickel Models 


A very important family of models (in general triaxial) is obtained by starting from the 
assignment of a potential expressed in ellipsoidal coordinates (A, 4, v) and then proceeding 
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with the evaluation of the density from the Poisson equation (A.157). In this approach, the 
ellipsoidal coordinates are usually defined using a different convention than that adopted in 
Eq. (2.9): now the unitary ellipsoid is 


x2 y? 22 


a + 
a+t BP+t yt 


where a < 6 < y < OandO < -y <v < —f < w < —a < dX. Surfaces of constant 
A are again confocal ellipsoids, but now with the short axis along x and the long axis 
along z. Surfaces of constant jz are hyperboloids of one sheet, and surfaces of constant v 
are hyperboloids ot two sheets. No difficulties are encountered in determining the new rela- 
tions between Cartesian and ellipsoidal coordinates and the expressions of the differential 
operators (see Appendix A.8), and the student is encouraged to work out the corresponding 
formulae. Many interesting models have been constructed in this way (e.g., see de Zeeuw 
1985a; de Zeeuw and Lynden-Bell 1985; de Zeeuw and Pfenniger 1988; de Zeeuw et al. 
1986; Hunter and de Zeeuw 1992; Lynden-Bell 1962b and references therein). Among the 
models that can be generated with this approach, a prominent place (due to their special 
orbital properties) is occupied by the Stdckel separable models, originating from potentials 
of the family 


= 1, (13.70) 


F(A) F(t) F(v) 
A—p(A—v) (w-A\(u—v) (—-A)\V—B)’ 


where F(t) is an arbitrary function. In addition to the separability of the Hamilton—Jacobi 
equation in ellipsoidal coordinates, and so allowing for a complete description of stellar 
orbits in these models (see also Section 9.3.2), several interesting properties have been 
proved to hold for Stackel models, the most famous being due to Kuzmin (see de Zeeuw 
1985b; Kuzmin 1956). Briefly stated, for the potentials in Eq. (13.71): (1) the potential 
along the z-axis (the long axis of ellipsoidal coordinates) is related to the density profile 
along the same axis by a linear second-order ordinary differential equation (ODE; the 
Kuzmin property); (2) for the assigned density profile along the z-axis, the Kuzmin ODE 
can be integrated, and the solution determines uniquely the potential (and so the density) 
over the whole space — the density elsewhere is related to that on the z-axis by the so- 
called Kuzmin formula, and usually the z-axis is the short axis of the resulting density 
distribution; (3) the Kuzmin formula shows that the density is everywhere non-negative if 
this holds along the z-axis (the Kuzmin theorem); and (4) the Kuzmin formula also shows 
that density profiles that fall off along the z-axis faster than z~> lead to stellar systems of 
finite mass, density profiles that fall off less steeply than z~* become spherical at large 
radii, while for a density falloff « z~*+ or steeper the models have finite flattening at large 
radii. In particular, density profiles that fall off faster than z~* lead to density distributions 
that in all other directions fall off as r~*, so they are of quasi-toroidal shape. We will not 
discuss this subject further, but we conclude by recalling that Stackel separable models 
are not the only models for which Kuzmin’s results hold: other nonseparable models 
in ellipsoidal coordinates also share the same properties. Moreover, it has been shown 


PA, Mv) = (13.71) 
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(Ciotti et al. 2012) that the Kuzmin property and a Kuzmin-like formula hold for Stackel 
density—potential pairs in the deep modified Newtonian dynamics (MOND) regime 
(see Exercise 2.34); this is quite surprising, as the differential operator in Eq. (2.123) 
is not the Laplacian but the nonlinear p-Laplacian. For these separable MOND systems, 
the density profile at large radii is in general not spherically symmetric and declines as 
Inr/r? instead of 1/r+. 


13.2.3.2 Complex Shifted Systems 


A method to construct axisymmetric (and triaxial) density—potential pairs from spherical 
density—potential pairs based on the linearity of the Poisson equation has been presented in 
Ciotti and Giampieri (2007) (see also Ciotti and Marinacci 2008; Vogt and Letelier 2009b 
and references therein). The method generalizes to extended mass distributions the concept 
of a holomorphic Coulomb field of a point charge introduced in electromagnetism (e.g., see 
Appell 1887; Lynden-Bell 2004; Newman 1973 and references therein). 

Let us assume that the (nowhere negative) density distribution p(x) satisfies the Poisson 
equation (2.6) with potential ¢ (x). The associated complexified density—potential pair with 
shift a is defined as 


Pc(X) = p(x—ia), (x) = (x — ia), (13.72) 
where i? = —1 is the imaginary unit and a € Si. It is elementary to verify that 
Ado =41Gp. VYae ®, (13.73) 


and so, if we separate p, and ¢ into their real and imaginary components, then we obtain at 
once two real density—potential pairs. Of course, in the gravitational case, the resulting pairs 
are physically acceptable provided they do not change sign as a function of x. In fact, in 
Exercise 13.12 we will prove that 3(,) must necessarily change sign and so cannot be used 
to model a stellar system. Instead, the sign of 3i(o,) depends on the specific parent density— 
potential pair and on the shift vector a, and can be everywhere positive, as illustrated in the 
following example. 

For simplicity, we restrict ourselves to a parent spherically symmetric density—pontential 
pair p(r) and ¢(r). The complexification rule is that the “norm” of the shifted positions 
must be evaluated formally? as 


r?(a) = ||x — iall? =r? — a? — 2i(x,a), a? = |lall’. (13.74) 
Without loss of generality, we assume a = (0,0,a), so that r?(a) =r* —2iaz—a? and 


ee ( fim Tae a?) . Gese (vr? — Diaz — a?) (13.75) 


Note that for a spherically symmetric (p,@) pair, (p.) and 3(p,) can be obtained: (1) 
from evaluation of the Laplace operator applied to Ji(¢,) and 3(@_); (2) by expansion of 


3 Tf one adopts the standard inner product over the complex field, one would obtain ||x — iall? ary ilall. 
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the complexified density in Eq. (13.75); or finally (3) by considering that for spherically 
symmetric systems p = p(@), and so the real and imaginary parts of pc(¢-), can be 
expressed (at least in principle) as functions of the real and imaginary parts of the shifted 
potential. We now consider the Plummer model in Eq. (13.36), and for ease of notation, 
we use densities normalized to M/ ae potentials to GM/rg, and lengths to re (so that the 
vector shift a must also be expressed in units of rc). It follows that UV = —@ = 1/V1+r?, 
and finally 


- mw) +i3s@or, w=+= (13.76) 
BS Ag OY Te Tee 
where from Eq. (13.75) 
co =14+r°@ =de’, 
d = |t.\? = VC — a? +r?)2 + 4a2z2, (15-77) 
l-a4+r? ; 2az 
cos = ————_, sin = —-—. 
Q A Q d 


Note that cosg > 0 everywhere for a < 1, and in the following discussion we restrict 
ourselves to this case. The square root ¢. = V/de™'+¥!/? is made to be a single-valued 
function of (7, z) by cutting the complex plane along the negative real axis (which is never 
touched by 2) and assuming k = 0, so that the model equatorial plane is mapped onto the 
line g = 0. With this choice, the principal determination of WY, reduces to Y when a = 0, 
and simple algebra shows that 


V1 2az 
ioe = = a sin 2 = wa (13.78) 


2 


so that 


Vd+1l—a2+r? az 
ROW) = », CY) = =. 
(We) a (We) = aaa 


From Eq. (13.76), we now obtain the expressions of the (normalized) axisymmetric 


(13.79) 


densities 
3H (W.) 10az* 
NR(pe) = — [sew - a + 59(W.)* |, (13.80) 
2 33 (We) ge Wee oa 
Soe) = —— | SHR(We)* — a + 3(W,)4]. (13.81) 


It can be easily verified (do it!) that the axisymmetric density—potential pair W(p.) — #CVe) 
satisfies the Poisson equation, and that for a < | the shifted density i(p.) is nowhere 
negative. An idea of the shapes of the density—potential pairs produced with the complex 
shift method can be obtained from Ciotti and Giampieri (2007), where the complexification 
was also applied to the isochrone sphere in Eq. (13.43); from Ciotti and Marinacci (2008), 
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where the complex shift was applied in the equatorial plane of the Miyamoto—Nagai disk 
in Eqs. (13.62)-(13.63), obtaining structures with three reflection planes of symmetry; 
or from Vogt and Letelier (2009b), where the shift was applied to the Kuzmin disk in 
Eq. (13.69). For sufficiently large shifts, the shifted densities are characterized by torus- 
like shapes, reminding us of structures such as the Lynden-Bell (1962a) flattened Plummer 
model, the density associated with the Binney logarithmic potential in Eq. (13.67), the 
Evans (1994) power-law models, the Toomre (1982) tori, the Ciotti and Bertin (2005) tori 
obtained from homeoidal expansion, and the Ciotti et al. (2020) galaxy models obtained 
using the same method. 


13.3 The Solution of the Jeans Equations 
13.3.1 Spherical Systems 


After choosing a density distribution, the next step in the construction of a model for a 
stellar system is the solution of the associated Jeans equations (e.g., with respect to the 
velocity dispersion). In this case, their closure (see Chapter 12) can be obtained by assum- 
ing that the (unknown) associated DF depends on certain isolating integrals of motion. For 
example, in spherical models, if we assume f = f(E€, J”) as in Section 12.2.2, the relevant 
second-order Jeans equation to be solved for the density component p is (prove it!) 


dp(r)on(r) | 2B(r) dor(r) 
= , 
dr r dr 
where the definition of 6(r) is given in Eq. (12.34) and Eq. (13.82) is obtained from 
Eq. (10.68) expressed in spherical coordinates as in Eq. (9.52). For a given choice of the 
functions p(r), dr(r), and B(r), the result of the integration is the function 0? (r). Then, 
0, (r) is recovered as 


prop (r) = —p(r) (13.82) 


of (r) = 207 (rn) — BY). (13.83) 


The natural boundary condition for Eq. (13.82) is the vanishing of radial “pressure” at 
(see Chapter 12), i.e., 


p(rio2(r,) = 0, (13.84) 


because, by definition of the truncation radius, no stars can cross 7, (=oo in most cases). 
Because of the linearity of Eq. (13.82), singular points for the solution may appear only at 
singular points of the coefficients (i.e., singular points of B/r and pd@¢y/dr), and in physi- 
cally acceptable models these points may occur only at the center and/or at r,. Remarkably, 
the solution of Eq. (13.82) can be cast in explicit form. In fact, this is a Bernoulli first- 
order inhomogeneous linear ODE (e.g., Ince 1927; Lomen and Mark 1988), and its solution 
(Binney and Mamon 1982) is given by 


d —2 fh Boddy 
WOU) a2 gy (13.85) 
dx 


poles / p(x) 


r 
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Due to its importance, we now show that Eq. (13.85) is the solution of Eq. (13.82) 
with the appropriate boundary condition. The simplest way to do this is to define the new 
function Y(r) = pe (r)o?2 (r) and to solve the resulting equation with the boundary condition 
Y(7,) = 0. From the theory of ODEs, the solution of the resulting ODE is given by Y = 
Yo + Yi, where Yo is the general solution of the associated homogeneous ODE and Yj is 
a particular solution of the inhomogeneous ODE. Let us determine Yo. The integration is 
trivial (do it!), and one obtains 


2" Bly)dy 
Yo(r) = Yo(o)e "79, (13.86) 


where ro # 7 1s for the moment left unspecified. The function Y; is now determined using 
the method of variation of constants; in other words, we set 


Yi(r) = Yo(r) Yo(r). (13.87) 


After substitution of Y; into the inhomogeneous equation, we are left with an ODE for Y 
with solution (prove it!) 


d x Biydy 
Gr) 2 ear 
dx 


: 
Ya) = Yatro) = 7 fp) (13.88) 
Yo(ro) Jiro 

where Y2(rg) is an arbitrary constant. A rearrangement of Yo(r) + Y,(r) shows that, for 
the choice Y2(r9) = —1, a cancellation of Yo(7o) happens, and it is now possible to choose 
ro = r; so that the boundary condition in Eq. (13.84) is satisfied. 

For example, in the case of anisotropic Osipkov—Merritt (OM) spherical systems from 
Eqs. (12.52)-(13.85) 


rt 2 
pews 1 pent ia ae 
: r2 + re - x2 
A 27 A(r) —r?1 
= Oe Se On (13.89) 


where the two functions A(r) and J (7) correspond to the purely radial and to the isotropic 
velocity dispersion profiles, obtained for r, = 0 and rg — ov, respectively. 

More complicated systems can be constructed. For example, in the case of anisotropic 
Cuddeford spherical systems, from Eqs. (12.63) and (13.85) 


G pe rt r2 a+] 
p(r)oe(r) = i | p(x)Mr(x) {14+ 4 dx. (13.90) 
r (r2 a poyert : x2 
As we know from Chapter 12, when a = 0, this reduces to the OM solution, while for 
Ta —> 0 this reduces to the constant-anisotropy case. 

We now derive the projection formula for the second-order velocity dispersion profile 
associated with spherical models with f = f(€, J) and anisotropy profile (r). Because 
in these systems the streaming velocity field vanishes identically, from Chapter 11 a. los = 


= ue ee In particular, for the moment of order 2, in Exercise 13.14 we show that 


N 
D Ios 
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Rt 2, 2 
¥(R)o2(R) = 2 | pos | PONTE >, (13.91) 
R r 


r2 — R2 


where for simplicity we adopted the standard notation oe = ve note how the projection 


integral for isotropic models is similar to that of the density projection. In Exercise 13.15, 
we specialize Eq. (13.91) to the case of OM systems. At the center of the image in the 
projection plane, when the integral in Eq. (13.91) converges, we have 


Rt 
X(0)op (0) = 2 f p(r)o7(r) dr. (13.92) 
0) 


The “observed” o, does not correspond to op(0), but rather to the luminosity (mass in the 
case of constant Y,,) average over the aperture used for the spectrographic observations 


2 


Rap 
aka X(R) o6(R) RAR, (13.93) 
M*(Rap) Jo . 


oe (Rap) = 
where Ms (Rap) is the projected stellar mass inside Rap. Usually, one identifies this quantity 
with og. 

With the aid of this obtained formula, it is an interesting exercise to verify directly 
the projected virial theorem for spherical systems introduced in Chapter 11. In fact, in 
Exercise 13.16 we show by direct integration that the general identity in Eq. (11.44) 
reduces for nonrotating spherical systems and generic £ profiles to 


K 
~~ 
(13.94) 


los los 


Rt rt 
[Ka = KQ =x [ x(RofRyRAR == |” perntoz(r) + oF OW7dr = 
0 0 


13.3.1.1 Power-Law Spherical Models 


As an example of highly idealized spherical systems, for which many dynamical quantities 
can be obtained explicitly yet retaining some qualitative features of more realistic models, 
we consider the untruncated one-component power-law models in Eq. (13.29) with OM 
internal dynamics and with 2 < y < 3. A central BH and an additional external power- 
law density component can be easily added, keeping the level of the presentation simple 
(see Exercise 13.19; see also the spherical limits of the models in Ciotti and Bertin 2005; 
Riciputi et al. 2005). The student is also invited to repeat the treatment of this section for 
OM models and spatially truncated power-law OM models, also adding a BH at their center, 
and to discuss the full range of slopes 0 < y < 3 (see also Exercise 13.21). 
The radial component of the velocity dispersion is obtained from Eq. (13.89) 


Sea an ( s° + 5 ). <— (13.95) 
s ie 2(3 — y)(s* + 2) y—2 y-1 c 


where s = r/rc¢, and the isotropic and purely radial anisotropic cases are recovered for 
Sq — oo and sg = 0, respectively 
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elke a as eee 
re 23-y)(y -) 2 y= (13.96) 
GM, s2-Y Wr) 
cOM= = saa os 5 
where we used VW = —¢ from Eq. (13.30). Therefore 
[ord Y= 1 168% 


[627 hice 7 y—2 
and o; in the radially anisotropic case is larger than that in the case of orbital isotropy, as 


expected. The projected velocity dispersion can be obtained from Eq. (13.91), and in the 
two relevant cases 


2 GM. ne ¥ B(1/2,y — 3/2) 
[op (R) liso = ; 
re 28—y)y — 1) BU/2,¥/2 = 1/2) “sen 
2 GM.” BG/2,y — 3/2) . 
[op (R) rad = , 
re 23 —y)(y — 2) BAL/2,y/2 — 1/2) 
where 7 = R/rc, so that 
2 
Lop (R) Iraa _ 1 (13.99) 


[op(R)iso 207 — 2)" 
Finally, the aperture velocity dispersion defined in Eq. (13.93) converges at the center for 
y < 5/2, and the student is invited to prove that 


= 5 (RRVhvo: [oe (RVaa = —~ 

A comment is in order regarding the behavior of o;(r) for r — 0 in isotropic spherical 
models that can be studied with the aid of power-law models. In fact, it turns out (e.g., 
see Bertin et al. 2002; Binney and Ossipkov 2001) that in the central regions of spherical 
systems with density approximated by p ~ 1/r”, the corresponding isotropic o;,(r) stays 
constant for y = 0, decreases to zero forO < y < 2, is again a positive constant for 
y = 2, and diverges for y > 2. A simple explanation of this nonmonotonic behavior of 
the central o, with the local value of y is that in a self-gravitating system, by imposing 
the density profile, we also impose the gravitational potential (i.e., the gravitational field 
that the system must balance with the “pressure” po? to be at equilibrium). But p is fixed, 
and so the “temperature” ae remains determined by the relative profiles of o and the 
gravitational field. Of course, for all of the values of y, the “pressure” po? is a decreasing 
function of 7, in accordance with the (isotropic) Eq. (13.82), and needless to say, an 
exact parallel of this phenomenon happens in fluid dynamics when imposing a potential 
and a gas density profile and searching for the temperature profile (see Section 10.4.1). 
It should remarked that several of the models encountered in the literature (e.g., the 
y-models or the Sersic R!/” profiles) are characterized by a sizable central depression 
in their op, with quite important implications for the interpretation of observational data 


[o2(R)liso = [op (R)Iraa- (13.100) 
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(e.g., see Bertin et al. 2002; Binney 1980; Ciotti 1991; Ciotti and Lanzoni 1997; Ciotti and 
Pellegrini 1992; Ciotti et al. 1996). 

We conclude this section by reminding the student that they will find in the literature 
numerous examples of more complicated (realistic) multicomponent spherical stellar sys- 
tems where the same calculations can be more or less easily carried out analytically (e.g., 
see the very incomplete list Ciotti 1996, 1999; Ciotti and Ziaee Lorzad 2018; Ciotti et al. 
2009, 2019; Dehnen 1993; Hernquist 1990; Hiotelis 1994; Jaffe 1983; Lokas and Mamon 
2001; Tremaine et al. 1994 and references therein). 


13.3.2 Axisymmetric Systems 


As with the spherical case, after the density distribution is assigned, the next step in the con- 
struction of a model for an axisymmetric stellar system is the integration of the associated 
Jeans equations. If we assume that the underlying DF is of the form f = f(E€, J), then 
from Chapter 12 ae = a, the velocity dispersion ellipsoid is rotationally symmetric in 
the meridional plane (R,z), and from Eq. (10.68) in cylindrical coordinates the stationary 
Jeans equations (prove it!) are obtained from Eq. (9.53) 


dpo? OorT 
Oz ~ are 
; (13.101) 
dpa; PXog dor > a) 2 2 
aR R aR’ Ag =p — 0; = Vy +05 —9;. 


Note that in the isotropic case Ag = ie (i.e., it reduces to the squared streaming velocity 
field), in complete (formal) analogy with the equations of fluid dynamics of (in general 
baroclinic) self-gravitating rotating structures, where the term po? acts as a “pressure” and 
the streaming velocity field vg acts as the fluid velocity (see Section 10.4.1). In stellar 
dynamics, this special but important class of models is known as the family of isotropic 
rotators: the deviation from sphericity of their density distributions can only be due to the 
azimuthal streaming velocity. 

Before proceeding with the integration of Eq. (13.101), a discussion of the boundary 
condition for the velocity dispersion is needed. If the system has an infinite extent, then the 
natural boundary condition is por — 0 for ||x|| — oo. Instead, systems spatially truncated 
by self-gravitation require that everywhere on the system boundary the normal component 
of the “pressure” po, a po; ;ninj vanishes, and in Exercise 13.23 we show that in our 
case this is equivalent to having an = o? = 0, independently of the shape of the truncation 
surface S(R,z) = 0. For assigned p and ¢r7, the general solution of Eq. (13.101) is easily 
obtained by integrating the first equation at fixed R 


z(R) 9 mR) 9 

poz =i LL =i pr ar!, (13.102) 
: 7 Oz! ; or’ 

where the function z;,(R) is given implicitly by S[R,z,(R)] = 0 (and for spatially untrun- 

cated systems z; = oo), and the second expression is useful when the density and potential 
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are expressed in terms of r and R instead of z and R (e.g., see Exercise 13.32), with 
re (R) = R*+ a (R). The quantity A, (and v2) is then recovered from the second equation 
by calculating the radial derivatives, and so we conclude that actually only one integration is 
required to solve the system in Eq. (13.101). However, this approach tends to obscure some 
important properties of the solutions, and in fact an alternative expression for A, can be 
obtained following an approach adopted in fluid dynamics to treat the case of axisymmetric 
rotating fluids (e.g., see Barnabé et al. 2006 and references therein; see also Tassoul 1978). 
An integration by parts, after substitution of Eq. (13.102) into the second equation on 
Eq. (13.101), shows that (prove it!) 


zt(R) 
p&g 0(R. 2) Pe do(R, zt) +/ (3 d¢r dp <) a 
R dR ; OR dz’ dz’ OR 
a 
= p(R,%)——— at re (13.103) 


(e.g., see Hunter 1977; Smet et al. 2015), in perfect analogy with Eq. (10.34) for 
untruncated isotropic rotators. Therefore, for a system with density vanishing on the 
boundary (or untruncated), the boundary term vanishes and only the commutator term 
[e, dr] survives; in Exercises 13.24 and 13.25, some revelant properties of the commutator 
are derived. In particular, in case of spherical symmetry of the density and potential, the 
commutator vanishes and so Ag = 0. Of course, for assigned density and potential, 
the model must certainly be discarded if A, +o? — v2 < 0, a manifestation of phase-space 
inconsistency. 

When A, >0 everywhere," the phenomenological Satoh decomposition is often 
adopted to break the degeneracy between ordered and random motions along the azimuthal 
direction shown by Eq. (13.101), where only the sum v2 = Bag + o5 appears 


Vg =k Ac, |kl <1 (13.104) 
(Satoh 1980), where k is a free parameter. From Eq. (13.104), one then obtains 


4 


o, = v2 0p = (1-W)Ag +07 = (1-k*)u2 +702. (13.105) 


For k = 0, no ordered rotation is present (i.e., OG = v2), and the model’s asphericity is due 


to tangential anisotropy only, while for k = 1, the velocity dispersion tensor is isotropic 
(1.e., a= = oR = a). Finally, note that Satoh systems in the spherical limit are necessarily 
isotropic due to the vanishing of the left-hand side terms in Eq. (13.105). Of course, & can 
also be a function of (R, z), with the only requirement being the non-negativity of oy. As is 
shown in Ciotti and Pellegrini (1996), this limit case is obtained by imposing the maximum 


amount of ordered rotation (i.e., for a7 = O everywhere) when the velocity dispersion 


4 See Barnabé et al. (2006) and Smet et al. (2015) for some general results regarding the positivity of A, based on the use of the 
commutator. 
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ellipsoid reduces to a zero-thickness disk. For a given model, the maximum bound of |k| 
at each point is therefore obtained once the solution of the Jeans equations is known, and it 
is given by 


a: 


ke < kax(R,z) = 1+ 
Ao 
By using this approach, counter-rotation can also be imposed on the Jeans solutions 
(Negri et al. 2014a). Other more complicated phenomenological decompositions can be 
made, such as that proposed by Cappellari (2008, 2020) with og # o-, with the implicit 
assumption of a three-integral DF. Explicit examples of solutions are the two-component 
Miyamoto—Nagai models in Ciotti and Pellegrini (1996) and the fully analytical solution 
for Miyamoto—Nagai models embedded in the Binney logarithmic potential (Smet et al. 
2015). 
Obviously, the Satoh decomposition and its generalization can be applied only provided 
Ag >= O everywhere. However, we can also consider the case of Ag < 0 somewhere 
provided v2 > 0 everywhere. In this case, a possible decomposition of v2 is 


Wp =k /Ag+o2, of =(1—-k*)(Ag +02), |k| <1. (13.107) 


For k = 0, the model in the tangential direction is fully velocity-dispersion supported, 
while it is fully supported by ordered rotation for k = 1 (see Caravita et al. 2021 for 
the multicomponent case). Note that, at variance with the Satoh decomposition, isotropy 
cannot be realized when A, < 0, because by imposing Oo; = o? in the second identity of 


(13.106) 


Eq. (13.107), we would have (prove it!) 0 > A, = Ue which is obviously impossible. 

Once the azimuthal decomposition is fixed, the next step in the model construction is 
the projection of the density and the velocity fields along a chosen /os, fixed by the unit 
vector n. Following Chapter 11, in order to obtain the projected velocity fields at position 
€ | in the projection plane, we must integrate along the whole Jos their projected component 
on n. In the case of a two-integral axisymmetric systems, we can assume without loss of 
generality g = 0,0 = i, and y = 0 in the rotation matrix R in Eq. (11.1), so that 
along the Jos, 


x =é,cosi+ sini, y=&, z= —&, sini + &3cosi, (13.108) 


and n = (sini,0, cosi) from? Eq. (11.2). In practice, the observer system S’ is determined 
by a rotation of # around the axis y = &, and for fixed &, and & in the projection 
plane the projection is an integration along 3. Let us now consider two of the most used 
projected velocity fields in applications, namely the projected streaming velocity field and 
the projected velocity dispersion field in Eqs. (11.9)-(11.11). Before integration along &3, 


5 Recall that n points toward the observer, so that a receding velocity is negative. 
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as dictated by Eq. (11.4), we transform these quantities from cylindrical to Cartesian coor- 
dinates as in Eqs. (12.129) and (12.130) 
Up(X) = —Ug(X) sing sini, 
=—— (13.109) 
03 (x) = 0? (x) + [03 (x) — o?(x)] sin’ g sini. 


Of course, in Eq. (13.109) i is fixed along the /os, while the angle g (not to be confused 
with the angle g = 0 in the rotation matrix!) is given by 


cosy=~, sing=. (13.110) 
R R 


From Chapter 11, the corresponding (mass-weighted) Jos projected fields are obtained by 
changing the coordinates in Eqs. (13.109) to those from Eq. (13.108) and then integrating 
on &3 


i (oe) 
Vis (EL) = acne ie pupdé3, (13.111) 


and from Eq. (11.16) 


ey 1 CO 
OF ios (E 1) = a. p (Up ~~ Vos) 483 = Opies si Volos ~ pos) (13.112) 
where 
a2, (&) = sz / pozdés, Upr(&,) = sz / pup-d&3. (13.113) 
P los xé,) ms Pp los SE) os 


Note that, independently of the /os orientation i, on the axis & (where, by definition, 
sing = 0, and usually corresponding to the short axis of the isophotal ellipse), p(x) = 0 


and o2(x) — oe (x), so that Up), = 0 and oy oe = Ona In addition, the last identity 
holds everywhere when observing the galaxy face on (i = 0) or in the case k = 0. In 
Exercise 13.26, we prove a simple but useful property of second-order projected fields. 
Since the observed velocity dispersion is always measured within a given aperture, we 
finally integrate Or os OVer the isophotes (even though Or os 1 general is not constant over 


isophotes) 
Mpoz = / LE 1 )oP og(E1)d7E 1 - (13.114) 
Aperture 


13.3.2.1 Ferrers Axisymmetric Ellipsoids 


An interesting (albeit quite artificial) family of triaxial ellipsoidal systems is represented 
by the Ferrers ellipsoids (Binney and Tremaine 2008; Chandrasekhar 1969; Ferrers 1877). 
Following Eq. (2.74), they are defined as 


pr(m) = p(0) x (1—m?)"0(1 —m), (13.115) 
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where (0) is the central density and n > 0 is usually an integer number: the n = 0 case 
corresponds to the homogeneous ellipsoid. In the axisymmetric case around the z-axis, 
a, is the semimajor axis of the truncation surface m, = 1 in the equatorial plane, while 
the flattening is given by 0 < gq, = |1—7n < 1. The interesting aspect of these models 
is that many of their properties can be computed analytically, which is quite unusual for 
ellipsoidal systems (e.g., see de Zeeuw and Pfenniger 1988; Lanzoni and Ciotti 2003). The 
mass within m, and the total mass of the model are given by 


3 S) 
M(m) = 2mp (0)azqyq-B (5 + iim?) , M= 2p (O)azqyq-B (5 + ') ; 
(13.116) 


where the Euler beta function is given in Eqs. (A.53)-(A.55); for integer n, the expressions 
in Eq. (13.116) reduce to simple algebraic form. Notice that the last identity in Eq. (13.116) 
can be used to express (0) in terms of pp = M/(4xa?), and so to recast Eq. (13.115) in 
the form of Eq. (2.28), and finally to determine p(m). From Eqs. (13.54) and (13.58), the 
surface density and the projected mass profiles are given by (prove it!) 


pe) = PO (Sant 1)a—eytt, My =m [1-0-4], (13.117) 


so that from Eqs. (13.57) and (13.59) it follows that 
fg =V1—2-VO8), (Re) = 01 le</Gygz0d1. (13.118) 
Note that Eq. (13.115) in the axisymmetric case with n integer can be rewritten form < 1 as 
toe (1)! (n\ (i 
p(m) = po Y) Yd) pnif (Gz) RILEY, pif (Gz) = a (") (;). Gs.115) 
i=0 j=0 qz 


where R = R /a\, 2 = Z/a. For this distribution, the potential in Eq. (2.75) can be easily 
calculated, because AW reduces to a finite sum, and after some algebra (prove it!) 


n+l i 
$2) = -nGnOe8 FY (a. dR"PE 
i=0 j=0 
q.(—1)! n+1\ (i  f£..2..3 ; 
j\42) = F | 
oii) =a peta i Jer att att ge): 


(13.120) 


where 2F is the standard hypergeometric function in Eq. (A.69). Note that 2F) (a,b, 
c;0) = 0, and that for i and j integers 2F; in Eq. (13.120) simplifies to a combination 
of elementary functions. Moreover, note that in the special case of the constant-density 
ellipsoid, from Eq. (2.67) we have ¢o909 = aigzwo(0) and from Eq. (3.11) we have 
doio(gz) = —w, = —w? and ¢011 (gz) = —w3, in accordance with Eqs. (3.62) and (3.63); 
for the quasi-spherical limit, see Eq. (3.13). The components of the gravitational energy 
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tensor W;; can be easily computed by using Eq. (10.64) specialized to the truncated case 
discussed in Exercise 2.8. In the axisymmetric case, W22 = W)1 and 


29. 1 
Wu = —Gp(0)2a3 2 2"! _B (5.20 i, 3), 


lom+2 \2 
W33 =-G oa "3 B 1 an +3 (13.121) 
AE el Gey I ND 


Finally, it is obvious that the Jeans equations (13.101) can be easily integrated, and in 
particular in Eq. (13.102) we have z, = q-, /a* — R?. In fact, thanks to the finite expansions 


of p and ¢, the quantities po? and pA, in Egs. (13.102) and (13.103) can also be obtained 
as polynomials. All of the steps of the computations, up to the projection of the resulting 
velocity dispersion fields and integration over isophotal ellipses in the projection plane, 
can be found in Lanzoni and Ciotti (2003). The resulting formulae are quite cumbersome 
and not illuminating, and so we do not report them here: however, the special case of the 
constant-density ellipsoid is remarkably simple (see Exercise 13.27). 


13.3.2.2, Homeoidally Expanded Axisymmetric Two-Tntegral Systems 


A quite general family of axisymmetric models, whose two-integral Jeans equations are 
amenable to an interesting formal solution, are those derived from homeoidal expansion 
described in Chapter 2. We recall that the obtained density—potential pairs can be inter- 
preted in two different ways: as the limit of ellipsoidal systems for small flattenings or 
as exact pairs with finite (and not necessarily small) flattenings. In the latter case, when 
evaluating products of the expanded functions (as required by the Jeans equations), all of 
the flattening terms up to the second order inclusive must be retained, while in the first case 
only linear terms in the flattenings matter. Of course, in both cases no special difficulties 
are encountered, but here for simplicity we restrict ourselves to the first case. Worked-out 
examples of solutions of the Jeans equations (also in the case of multicomponent systems 
with a central BH) can be found in Ciotti and Bertin (2005), Ciotti et al. (2020), and Riciputi 
et al. (2005). We first consider the solution of the vertical Jeans equation (13.102), and 
we recognize that the density—potential pair in Eqs. (2.29)-(2.31), when specialized to 
the oblate axisymmetric case (€ = Q), is not in the most useful form to be integrated, 
as it explicitly contains the variable z. However, with the substitution 2 = r* — R?, 
we can rewrite the dimensionless density—potential pair associated with p = pnp(m) and 


g = —W,o(x) as 


p(m) 


Cay ~ Pats) + mils) + nR°pr(s), (x) ~ bo(s) + ndi(s) + nR*o2(s), 


(13.122) 


where the meaning of the six spherical functions is obvious from Eqs. (2.29)-(2.31). 
Equation (13.102) is evaluated by expanding the integrand and retaining the terms up to 
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the linear order in the flattening and changing the variable from z’ to the spherical radius r’ 
at fixed as R, obtaining 


po: = / poy ds! ~ CUP LACS) + nB(s) + nC) (13.123) 
S 1 
where the three radial integrals are 
A(s) = — 1 ols’) (s") as, 
B(s) = — / [Bo(s)G{ (s") + Ar(s)@G(s)] ds’, (13.124) 


C(s) = — / [Pols )b5(s") + Ars’) Gh(s’)] ds’. 


Equation (13.103) is solved similarly by using the rule established in Eq. (13.172), and 
after integration by parts one obtains the remarkable expression 


GMp. ~ ae de 
pds = nk? [ CO) — Ao(s)42(0)]. (13.125) 


Note how the quantity in Eq. (13.125) vanishes in case of a spherical model, as is expected 
on general grounds for two-integral systems, and how the function C(s) appears the same 
in the vertical velocity dispersion. Finally, the student is invited to show that the self- 
gravitational energy can be written as 


2 


GM 
U=-— 
a 


(Uo + nU1), (13.126) 


and to write Up and Uj in terms of the functions appearing in the homeoidally expanded 
density and potential (see Ciotti et al. 2020). 

A few comments are in order before concluding this section. First, perfectly analogous 
formulae hold in the prolate case, obtained from Eqs. (2.29)-(2.31) with « = 7, and so 
R? = y*+2?, and x is the symmetry axis. Of course, now the transformation z7 = r* — R? 
is not required, and (1—1)? appears at the denominator of #(m), so that the radial functions 
in the expanded density—potential pair are not the same as in Eq. (13.122), but they can 
be worked out (do it!) without much difficulty. Therefore, the structure of Eqs. (13.123) 
and (13.126) is unchanged, even though the functions B and C change. Second, it is not 
difficult to prove (do it!) that in general A > 0, and that if 6’ < 0, then C > 0 in the 
oblate case, while C < 0 in the prolate case; this means that we can have a (low-flattening) 
self-gravitating oblate isotropic rotator, but this is impossible in the prolate case. As we 
will see in Exercise 13.29, this result actually holds even for finite flattenings. 

We conclude this section by reminding the student that the solutions of the Jeans equa- 
tions can also be obtained for genuinely triaxial systems or for axisymmetric models with 
three integrals of motion, but these technical problems are well beyond the level of this 
introductory book (e.g., see de Zeeuw et al. 1996; Dejonghe and de Zeeuw 1988; Evans 
and Lynden-Bell 1989; van de Ven et al. 2003 and references therein). 
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13.4 The Fundamental Plane of Elliptical Galaxies 
and the Virial Theorem 


A very specific but far-reaching application of the Jeans-based modeling techniques is the 
physical interpretation of a remarkable empirical relation obeyed by stellar spheroids (i.e., 
early-type galaxies and bulges): the so-called fundamental plane (FP; e.g., see Djorgovski 
and Davis 1987; Dressler et al. 1987; see also Cappellari 2016; Cimatti et al. 2019). We 
cannot describe all of the observational and theoretical issues surrounding the problem, as 
an immense literature is available, so we simply focus on the most elementary dynamical 
aspects (for a more extended overview, see, among others, Ciotti 2009; D’Onofrio et al. 
2016 and references therein). In particular, we will attempt to clarify some of the confusion 
that unfortunately is present in the literature and that can be quite dangerous for the student 
(i.e., the idea that, after all, the FP is nothing more than the virial theorem in disguise, and 
so that it is possible to “deduce” the existence of the FP just from the virial theorem). 

Stellar spheroids are characterized by three main observables: the circularized effective 
radius (R).~ defined in Eq. (13.60), the luminosity-weighted projected velocity dispersion 
o~. measured within some prescribed aperture (e.g., Ra = (R)e/8) defined in Eq. (13.93), 
and the mean effective surface brightness (J). = L/(2a7(R i) where L is the total luminos- 
ity of the galaxy in the adopted band. Therefore, each galaxy is represented by a point in the 
three-dimensional parameter space ((J)e, (R)e,0.) or (L, (R)e, 0). Statistical studies reveal 
that the parameter space is not populated uniformly; rather, the data points are confined to 
the vicinity of a narrow logarithmic plane, the FP 


log(R)e = alogo, + Blog(Ie+y. (13.127) 


The coefficients a, 6, and y depend slightly on the considered photometric band. For 
example (see Jorgensen et al. 1996), by measuring (R)- in kpc, o. in km s_, and (J)¢ 
in Le/pe’, reported values in the Gunn r band for Coma Cluster galaxies (with Ho = 
50 km s~! Mpc7!) are a ~ 1.24, 8 ~ —0.82, and y ~ 0.182. Moreover, the FP presents 
a strikingly small and nearly constant scatter; (R)e presents a scatter around the best-fit FP 
at fixed o, and (/)_ in the order of 15% or less. As is well known, the famous scaling laws 
of Faber and Jackson (1976) and Kormendy (1977) and D, — o (Dressler et al. 1987) are 
the natural consequences of the distribution of data points on the FP and of its projections 
on the three coordinate planes of the parameter space. 

No definite/universally accepted interpretation of the FP and its origin has been found 
yet, while it is sometimes stated that — after all — the FP is “nothing more” that the virial 
theorem. As we will see, if interpreted literally this is a wrong statement, akin to claiming 
that the main sequence in the Hertzsprung—Russell diagram is nothing more than the hydro- 
static equilibrium of gas spheres! In fact, we will see that even though the virial theorem is 
a necessary condition for the existence of the FP, it is by no means sufficient. We remark 
that, for the following discussion, the “exact” values of the FP coefficients w, 6, and y are 
irrelevant — what is fundamental is that they are robustly measured and that the data points 
have a small scatter around the FP. 
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The first consideration comes from the fact that stellar spheroids are much older than 
their characteristic dynamical times Payn ~ 2Re/00 © 108 yrs, so that these systems, in the 
absence of strong perturbations, are essentially at equilibrium (Lynden-Bell 1967; see also 
Section 6.2); this conclusion is reinforced by the observed large-scale regularity of their 
density profiles. It is then plausible to assume that the vast majority of stellar spheroids are 
virialized systems. Now, for a galaxy of total stellar mass M,, embedded in a dark matter 
halo of total mass My, = RM,, the virial theorem can be written in full generality (see 
Chapters 6 and 10) as 


2Ky _ |W a |Wen| _ GM, 


Wax| + R|Wenl), 13.128 
Ms M, : x (Weel + R| Wend) ( ) 


2 
ov = 


where oy and K,, are the (three-dimensional) virial velocity dispersion and total kinetic 
energy of the stellar component, respectively, Y,, = M,,/L is the stellar mass-to-light ratio 
in the specific band used to measure L and (R)e, and finally 7, is a scale length of the stellar 
component. From Eq. (10.52) 


— / “pal. Voe)dx, Wan = — / pal, Von) dx, (13.129) 
wR 3 


x 


where ¢,, and ¢p are the gravitational potentials generated by stars and dark matter, respec- 
tively. As an example, in Exercise 13.17 we compute these functions for the two-component 
Plummer model. 

We now relate the previous identity with observational quantities. From Chapter 11, we 
know that (R)~ and o2 are related, respectively, to r,, and oy by dimensionless coefficients 
that depend on projection effects combined with the intrinsic shape of the stellar distri- 
bution, on the dark matter amount and distribution, and on the specific internal dynamics 
(rotation, orbital anisotropy, etc.). For each galaxy, we can then write 


rx = Cs(structure, projection) x (R)e, 


; (13.130) 


o? 


oe = Cp(structure, anisotropy, projection) x o 


where in the presence of color and/or metallicity gradients in the galaxy, the coefficients 
will also depend on the adopted wavelength of observations. It follows that Eq. (13.128) 
can be rewritten for each galaxy as 


K CsC 
= (R).e2, Ky= 
GY, | Wye + R| Wyn! 


Equations (13.127) and (13.131) are all that we need. In fact, their combination (they 
must be both true!) implies that in real spheroids, no matter how complex their structure is, 
Y,./Ky is a well-defined function of any two of the three quanitities (L, Re,o,); in fact, by 
eliminating (for example) o, from the two identities, one concludes that on the FP 


(13.131) 


= ea ey eae, (13.132) 
Vv 
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The dependence of the ratio Y,./Ky on galaxy properties is commonly referred to as the 
“FP tilt”? The physical content of Eq. (13.132) is truly remarkable: all stellar systems 
(real or hypothetical) described by Eq. (13.131) are virialized, but only those for which 
Y./Ky scales according to Eq. (13.132) correspond to real galaxies. In other words, the 
very existence of the (thin) FP dictates that the structural/dynamical (Ky) and stellar 
population (Y..) properties of stellar spheroids are strictly connected, possibly as a con- 
sequence of their formation process; understanding the origin of the FP tilt is thus of the 
utmost importance for the understanding of galaxy formation. Of course, this argument 
conclusively shows that the virial theorem by itself does not imply any FP. In fact, different 
galaxies, all virialized, in principle could have very different Ky and Y., and so be scattered 
everywhere in the (Re, L,o.) space; perhaps the simplest concrete example is given by 
spiral galaxies, which are certainly virialized, but they do not define any FP. Note that by 
using values for aw and f such as those reported above, there is only a mild dependence of 
Y./Ky on (R)e, and in the hypothetical case of 8 = —1 and a = 2, from Eq. (13.132) we 
would deduce Y,./Ky = const. (i.e., that stellar spheroids would follow strong homology). 
The fact that in nature the values of a and 6 almost correspond to strong homology led 
sometimes to the erroneous identification of the FP with the virial theorem, while it should 
be clear by now that the correct conclusion is that stellar spheroids are almost homologous 
systems, and that a remarkable fine-tuning between Y,, and Ky is required to produce the 
FP tilt and yet preserve its surprising tightness. Note that this conclusion would be the same 
even if a and 6 were found to be very different from their current values, as far as the FP 
would be thin. 

Therefore, the central problem posed by the existence of the FP is to determine what 
are the physical ingredients responsible for the tilt. A very large body of literature has been 
produced in this area, and here we necessarily restrict ourselves to some selected examples, 
without any pretense of completeness. Perhaps the simplest approach to extracting infor- 
mation from Eq. (13.132) is to focus on the variation of a single galaxy property among the 
plethora in principle appearing in the quantity Y,./Ky, while fixing all of the others to some 
prescribed value. For instance, one can explore the possibility that a systematic variation of 
Y, with L is at the origin of the FP tilt, while considering the galaxies as structurally and 
dynamically strongly homologous systems with the same Ky. In this case, it is elementary 
to conclude that the FP would be due to a systematic increase of Y,, with the galaxy L, and 
the study then moves to the field of stellar evolution and stellar populations (e.g., see Bender 
et al. 1992; Prugniel and Simien 1996; Renzini and Ciotti 1993; van Albada et al. 1995). 

The other extreme possibility is to assume a constant Y, among all galaxies and so 
to require that the galaxy density profiles, dark matter content and distribution, stellar 
orbital distribution, and so on vary systematically with (R).~ and L, so that Ky obeys 
Eq. (13.132). This possibility is called weak homology. The modeling of weak homology is 
a natural field for the use of Jeans modeling: the usual approach is to build two-component 
models (stars plus dark matter), solve the Jeans equations, project them and obtain the Ky 
coefficient, and change the structural/dynamical parameters until Eq. (13.132) is satisfied 
(e.g., see Bertin et al. 2002; Caon et al. 1993; Ciotti and Pellegrini 1992; Ciotti et al. 1996; 
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Graham 1998; Graham and Colless 1997; Hjorth and Madsen 1995; Prugniel and Simien 
1997; Renzini and Ciotti 1993). In general, one sees that the relative amount/distribution 
of dark matter versus the stellar component should increase along the FP or (for example) 
have a Sersic index m of the stellar distribution that increases for increasing L, in broad 
agreement with observations. Another possibility for systematically changing (decreas- 
ing) Ky at increasing L is that more and more luminous galaxies are more and more 
radially anisotropic due to obvious projection effects (e.g., see Eq. (13.162)), with the 
consequent increase of o, at fixed ov. However, it is difficult to reconcile there being a 
major role of this effect in explaining the FP tilt with the expected onset of the radial orbit 
instability (see Section 6.2) in the resulting anisotropic models (Ciotti and Lanzoni 1997; 
Nipoti et al. 2002). 

Three final points are worth mentioning at the end of this brief discussion. The first is 
that galaxies are not spherical, and so we must understand what happens to a represen- 
tative point of a galaxy in the parameter space (L,o.,(R)e) as a function of its relative 
orientation with the observer, because both o, and (R)~ change while L stays constant 
(see Chapter 11). By using analytical and numerical models (e.g., see Bertin et al. 2002; 
Gonzalez-Garcia and van Albada 2003; Jorgensen et al. 1993; Lanzoni and Ciotti 2003; 
Nipoti et al. 2002, 2003; Saglia et al. 1993; van Albada et al. 1995), it is found that projec- 
tion effects move models not exactly parallel to the edge-on FP by an amount that can be 
comparable with the observed FP thickness, and so the intrinsic scatter of the FP is smaller 
than the observed one, making the problem of fine-tuning even more interesting. The 
second point relates to the effects of galaxy merging, which, of course, if they occur, must 
preserve the FP tilt and thickness, and this is not obvious at all (see Section 6.2.2 and refer- 
ences therein, in particular Ciotti et al. 2007). Third, a very interesting problem, which we 
only mention briefly here, is posed by the comparison of the scaling laws followed by the 
dark matter halos predicted by cosmological simulations with the observed FP of galaxies 
(Lanzoni et al. 2004). 


Exercises 


13.1 By changing the order of integration, verify the Abel inversion theorem in Eqs. 
(13.1) and (13.2) by using the Euler integral 


y dt T 
i =Bwd=oje—"—. Cee, (13,133) 
x (—x)!-*(y —-1)% sin(z a) 


Note how the value of this remarkable integral (a complete beta function in dis- 
guised form) is independent of x and y. Hint: Use the second identity of Eq. (A.46) 
and 
Eq. (A.54). 

13.2 Deduce the deprojection formula in Eq. (13.5) for transparent spherical systems. 
Hints: In Eq. (13.4), define x = r?, X = R?, X; = R?, U(X) = D(VX), u(x) = 
p(./x). Show that U(X) = na u(x)(x — X)~!/?dx and invert it by using the 
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Abel formula. After restoring the original variables and conducting an integration 
by parts and a successive differentiation with respect to r, complete the proof. 

For the Plummer sphere in Eq. (13.36), show that the projected density and the 
spatial half-mass radius are given by 


Mr; 1421/3 
=(R) = — aa Th = Tes 
V3 


m(R2 + r2)2’ 
in agreement with Eq. (13.37) for B-models evaluated for 6 = 5. Moreover, show 
that for the family of B-models 


(13.134) 


—3 3 
M = 2mp(0)r2B (=. 5): B > 3, (13.135) 
and that, under the assumption of a constant mass-to-light ratio Y,, the core radius 
(for 6 > 1) and the effective radius (for 6 > 3) are given by 


2 2 
Ro =V2F1 —1re, Re =V 273-14. (13.136) 


Prove Eqs. (13.15)—(13.17). Hints: The first two identities in Eq. (13.15) are imme- 
diate consequences of the definition of S, while for the last identity use Eq. (13.4), 
invert the order of integration to obtain 


S(x) [ow )d a = (13.137) 
x)= r)ar : : 
x x /(R2 — x2)(r2 — R?) 


and finally evaluate the inner integral to 27 from Eq. (13.133). The identities in 
Eq. (13.16) are now elementary. Finally, Eq. (13.17) is proved by inserting into 
the last identity of Eq. (13.12) the two expressions in Eq. (13.16) and performing 
integration by parts with the conditions S(7,) = 0 andr S(r) > 0 forr — 0. 

Prove the identities in Eq. (13.18). Hints: For the first identity, use Eqs. (13.11) 
and (13.16). The second identity is obtained from the first considering the limit for 
r — 0 under the assumption that $(0) = 2 i I(R)dR is finite and then using the 
mean value theorem for integrals (or de 1’ H6pital’s rule). 


Compute the self-gravitational energy for y-models in Eq. (13.38) and show that 


U=W GM" ! 0< 5/2 (13.138) 
— — - < " is 
7; 16-2 ° >" 


Explain with physical arguments what happens for y > 5/2. Hints: Use the last 
identity of Eq. (13.12) and change the variable to t = r/(1 +r) in the resulting 
Tchebishev binomial integral. 

By using Eq. (13.12), compute the self-gravitational energy for the King (1972) 
model in Eq. (13.34), for the NFW model in Eq. (13.40), for the Hénon (1959) 
isochrone in Eq. (13.43), and finally for the spherical limit of the perfect ellipsoid 
in Eq. (2.93), and show that 
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M2 = M?’*(3n—8) M? 
Qf (c)2Kre’ 12r- 


’ 


U=W=-G [an p(0)?r?, i; (13.139) 


20Te 


respectively. 

13.8 Here, we prove an interesting identity concerning the self-gravitational energy U of 
a spherical system expressed in terms of the projected density profile &(R). First, 
show that the triple integral in Eq. (13.17) can be rewritten as 


m/4 
U= -166 | K(tan g) f(g) sing dg, (13.140) 
0 


where K is the complete elliptic integral of the first kind in Eq. (A.65) and 
[o,@) 
f(y) =| X(R cos v)=X(R sing) R7dR. (13.141) 
0 


Then, show that in the case of the Sersic profile in Eq. (13.19), the function f 
evaluates to 
© (0)? R3 ml 3m) 
[b(m) Q(g) >” 
so that from x = tan g for the Sersic a 
3 16mI (3m) K(x)x 
Re b(m 3m o ( + x1/m)3m dx 


Hint: For a proof, see Baes and Ciotti (2019b); see also Ciotti (2019). 
13.9 Show that for an axisymmetric ellipsoidal system with a; = ay and g, = a3/a, 
independently of g 


f@) = , Q(v) = ose)!" + (sing), (13.142) 


U =—-G>(0)? (13.143) 


sin? 0 1 : 
cos” e+ 0 l=. sin } cos 3 


: zZ 


0 1 0 , (13.144) 


1 
(1- +) sin eos 9 0 sin? 0 + 


Va 


i) 
ay 


1 ° 1 
a? = (sie pee “). (Ef, €,) = 2 (1 - =) sin} cos#, (13.145) 
a qd: 


and that the matrix H is diagonal with 
1 
H = — | cos? 0 + q?2 sin? 3 
ay 
0 1 


, @=(HE,,é,). (13.146) 


Finally, prove that in the spherical case, a = 1/a,, H and E are independent of v7, 
and £ = R/a,, where R is the radius in the projection plane. 
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13.10 With this exercise, we compute the self-gravitational energy of some of the 


13.11 


most famous axisymmetric razor-thin disks of finite mass. From the identities 
in Exercise 10.13, prove that for the Kuzmin disk in Eq. (13.69), for the exponential 
disk in Eq. (2.110), for the truncated constant-density disk in Eq. (5.47), for the 
Maclaurin disk in Eq. (5.49), for the (truncated) Mestel disk in Eq. (2.115), and 
finally for the finite Mestel disk in Eq. (5.51), one has 


1 37 8 3x 4G a 
4Rq’ 32Raq° 3 R, 10R,’ wR,’ 2R; ; 


U = —GM? x (13.147) 
respectively, where G ~ 0.916 is the Catalan constant. For the Kuzmin disk, repeat 
the integration by directly using the density—potential pair. Evaluate numerically the 
coefficients above and relate their behavior with the disk concentration. Hints: In the 
following, we use Gradshteyn et al. (2007). For the Kuzmin disk from Gradshteyn 
et al.’s eq. (6.554.4) 

M a 


Lk) = —e, A=ka, (13.148) 
20 


and the result is now elementary. For the exponential disk, use Eq. (2.114), so that 
the resulting integral is elementary. For the truncated constant-density disk and for 
the Maclaurin disk, use Eqs. (5.53) and (5.54), respectively, and calculate U from 
Gradshteyn et al.’s eq. (6.575.2). The (truncated) Mestel disk case is most easily 
solved by evaluation of U in terms of v, in Eq. (5.45) and from Gradshteyn et al.’s 
eq. (6.141.1). The case of the finite Mestel disk is more difficult: use Eq. (5.55) and 
integrate twice by parts, with the appropriate limits at 0 and oo and with the aid of 
Gradshteyn et al.’s eqs. (3.741.3) and (6.254.2), obtaining 


© Si? (A) © gin? A 
di=2 dA=n. (13.149) 
ae eo & 


Evaluate the self-gravitational energy of the Miyamoto—Nagai models in Eqs. 
(13.62) and (13.63) and show that 


Us) GM?[ 1-2s* «a F(s) a (13.150) 
s)= eS; : 
8b |s(l—s2) 252 s2(1 — 37) b 
arccos(s) i2 1 
— ——— > — Ss = ’ 
/| — 52 
F@) = . (13.151) 
arccosh(s) 
———, s>l, 
gs? — 1 
with F(1) = 1. Then prove that U(1) = —(GM?/b)(1/3 — 1/16), U(O) = 


—(GM?/b)3x /32 (the Plummer sphere), and U(oo) = —GM?/(4a) (the Kuzmin 
disk), in agreement with previous results. Hints: Normalize all lengths to b. Integrate 
first over R, changing the variable to R* = x. The resulting expression is rational in 


13.12 


13.13 


13.14 


13.15 
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the funtion 1 + z?. Change the variable to z = sinh y and obtain a rational func- 
tion in cosh y. Perform a standard Hermite partial fraction decomposition, transform 
cosh y in terms of the exponential, change the variable to e” = ¢, and integrate the 
rational expressions; see also Ciotti and Pellegrini (1996). 

With a change of integration variables, prove that the gravitational energy of the 
complexified system coincides with the gravitational energy U of the parent density, 
ees, 


1 1 
U.= al pebed?x = >| pod°x =U, (13.152) 
2 J93 2 Jn3 


and deduce that 3(U,) = 0. Then prove the identity 
G / NL Pc(X) SL Pc (X’)] 
ne 


- ord = S(Us), (13.153) 
IIx — x"|| 


so that the integral vanishes. Along the same lines of reasoning, show that the total 
mass of the complexified distribution M, = f pcd 3x coincides with the total (real) 
mass of the seed density distribution M = a pd>x, so that 


/ 3(p.)d?x = 0. (13.154) 
ns 


What happens to Eq. (13.82) in the case of purely tangential orbits when o, = 
0 at all radii? Show that in this case ae (7) = ue(r), the circular velocity of the 
model. Hint: Consider the behavior of the anisotropy parameter f in Eq. (12.34) for 
a vanishing radial velocity dispersion. 

By direct integration, prove Eq. (13.91). Hints: Dv... =f p(n, v)2dé3. Given the 
spherical symmetry, we can assume without loss of generality | = x, & = y, and 
&, = z. As a consequence, r = ||é||, R* = x? + y*, and n = (0,0,1), and so 
ve... of pv2dz = a p(r)v2(r)(r — R?)~'/?dr. Expressing v, in terms of 
the velocity components in spherical coordinates and recalling that v-vy = 0, one 
obtains v2 = (vu; cos 9 — vp sind)? = v2 cos” 0 + v5 sin? 3 = v(1 — Bsin’ d), 
and from the identity sin’ %* = R?/r? the result is proved. 

In numerical studies, the evaluation of Eq. (13.91) usually requires the evaluation 
of a double integral. By using integration by parts, show that in the case of OM 
anisotropy (with the exception of the special case R = 0 and rg = QO), the integral 
can in fact be reduced to a single integration 


R 2 
=(R)op(R) = c| K(r)p(r)Mz(r) (1 4 “) dr, (13.155) 
R r 


where 


rw 2r? + R? : r2 — R? R?4/r2 — R? 13.156) 
= S59 na al Clan 7 7 
OF + RSP BPR Us +r GP +R) 


Discuss the case for R = 0 andr, = 0. 
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By direct integration, prove Eq. (13.94). Hints: Use Eq. (13.91) and invert the order 
of integration, so that 


t= p/P 
0 vr R 


Qn 9 9 
= = f p(r)o,(r)*[3 — 2B(r)]r“dr. (13.157) 


R, rt 
x | X(R)op(R) = 2x f p(r)o2 (r)rdr RdR 
0 0 


Finally, use Eq. (12.34). 


Consider a stellar Plummer model of mass M,. and scale length r,. in Eq. (13.36). 
First show that 


2 
GME 3m oxy) _ YO 


U=W= — o(r)= 
32 6 


: (13.158) 


where the second identity holds for the isotropic case. Then embed the stellar model 
in a dark matter Plummer sphere of total mass My = RM, and scale length ry = 
&r,, and from Eqs. (13.13) and (13.14) show that 


GM2ZR (&7 + 1I)E(V/1 — &2) — 2é?K(/1 — 5) 


Uext = Gp? (13.159) 
GM2ZR &2(3&2 + 5)K(\/1 — &2) — (JE? + I)E(,/1 — £2 
Woe = —* &°(3&" + 5)K( i + 1IE( e) (13.160) 


where K and E are the complete elliptic integrals of the first and second kinds in 
Eq. (A.65). Finally, solve the Jeans equations in the OM case by using Eq. (13.89) 
and show that o, can be obtained in explicit algebraic form, and that at the center 


OMe 2R(E + 3) aE 4R }} 
6r,. &(1+ &)3 3(€ + 1)3 


where sa = ra/rx (see also Renzini and Ciotti 1993). Discuss as a function of € the 
limits of a dominant dark matter halo and the purely isotropic and radial cases. 


o, (0) = (13.161) 


The projection of the OM velocity dispersion in the two-component Plummer model 
of Exercise 13.17 in general cannot be expressed in terms of elementary functions. 
However, in two important cases the projected central velocity dispersion can be 
obtained quite easily. Show that in the purely stellar OM case 


GM, (353 + 652 + 55, + 4) 


2 
0) = 
ae 645,(s, + 1)2 


(13.162) 


What happens in the isotropic limit? What happens at decreasing sy? Explain the 
behavior by considering projection effects. Compare this with Eq. (13.161) with 
R = 0 and explain why o,(0) < o;(0). Another simple case is the purely isotropic 
two-component model with dark matter. Show that in this case 
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GM, E 4 pit E-)VE(/ 1 — 7) — 6&7 +3)KG/1 — a 


2 _ 
op 0) = 64 2&2 — 13 


lx 
(13.163) 


Draw a plot as a function of &. 

Consider a supermassive BH of mass Mgy at the center of a power-law galaxy as in 
Eq. (13.29). Solve the OM Jeans equations and show that for y > 1 the contribution 
of the BH gravitational field to the velocity dispersion profile is 


GM 1 - 2 
o2(r) = 8 Bt RN (13.164) 
re s(s*+s2)\y-1 yl Tis 


Moreover, from Eq. (13.91), show that the associated isotropic and completely 
radially anisotropic projected velocity dispersion profiles are 


GM B(1/2,y/2 
[a2 (R)liso = (y+ SB 1/2)’ 

iG a ° (13.165) 
[o2(R) kaa = —— aul 


ren (vy — 1)B(1/2,y/2 — 1/2)" 
Finally, from Eq. (13.93), show that a finite-aperture velocity dispersion in the 
presence of a central BH requires y < 2, and in this case 


3- 3- 
[or Ryhso = 5— oR RVhvo» (oe (Rhus = 5 — Lop RVhaa- (13.166) 


2-y 
Consider the SIS in Eqs. (2.63) and (13.29) with a constant circular velocity ve. 
Solve the Jeans equations in the isotropic case and show that 


2 
ee a aah (13.167) 


(see also Exercise 12.6). Then, by using the considerations after Eq. (12.63), inte- 
grate Eq. (13.90) for rg — ov, thus obtaining the constant-anisotropy solution for 
B = —a, with the isotropic case corresponding to a = 0. Show that oO, (r) — 
0.5u2/(a + 1), while o2 = v2 independently of a. 

With these (quite artificial) models, we elaborate on the peculiar behavior of oy and 
ry for some model with infinite total mass (see Footnote 6 in Chapter 6). Consider 
the families of spherical density distributions 


AO(r — Aew"/" 
pir) = aa 


a (13.168) 


Evaluate their total masses, solve the OM Jeans equations (also in the presence of 
a central BH), and finally compute the self-gravitational energy U = W. Deter- 
mine the limit on y for the convergence of U (see also Exercise 13.6). Discuss the 
behavior of oy and ry as a function of y for 7, — oo. What happens for y = 2? 
Is the result consistent with the expectations regarding the SIS that can be obtained 


276 


13.22 


13.23 


13.24 


13.25 


Modeling Techniques 2: Moments Approach 


from Eq. (13.167)? Hints: Consider Exercise 10.17 and discuss the behavior of the 
virial theorem surface pressure term for an SIS with a spherical control volume Q 
of radius r, for r > oo. 

This exercise addresses a question that emerges quite often in discussions with 
students (and not only them!). From Newton’s second theorem and Eq. (13.82), 
it follows that only the mass inside the radius r determines the variation of velocity 
dispersion at r. However, from Eq. (13.85), o, appears to be determined by an 
integral extending from r to 7. Explain why this is the case. Hints: For simplicity, 
consider the isotropic case and consider a stellar system with density profile p(r) 
in the gravitational field of a shell of mass M, and radius rs, ignoring the self- 
gravitational field of the system. From the solution of the Jeans equation, show 
that for r > rs the radial “pressure” p- = po; is the same as that for a central 
mass M,, while for r < r, the pressure is constant p;(r) = p;(rs), even if there 
is no gravitational field inside the shell. Discuss the analogous case of a gas in 
equilibrium in the field of the shell and conclude that you need a pressure inside the 
shell to support the gas outside the shell. 

Show that the natural boundary condition on o, for the vertical Jeans equation 
(13.102) of an axisymmetric stellar system with f = f(€,J,) and spatially trun- 
cated on the surface S(R,z) = 0 is og = o, = 0 0n S. Hint: At any point on the 
boundary, the unit vector is given by 


(Sx, Sy,Sz) (Sr cos g, Sr sing, Sz) 


N= (nx,Ny,Nz) = _ : (13.169) 
2: 2. 
dae a af Spe Se 
where S; = 0S/dx, etc., and Sr = 0S/0R. Use Eq. (12.129) to show that 
S202 + S202 
) ee) ee PE SS! | REY <4 g°2 nw 
On = ONin;j = 2452 + 52 = 0;, (13.170) 


and conclude. 

In the same spirit as the second identity of Eq. (13.102), show that if the density and 
potential are assigned in terms of r and R, then the commutator in Eq. (13.103) can 
be written as 


_ dp ddr _ 9p Agr) 1 dp ddr _ 9p Abr) |, 
= a 
: aR oz az oR) J, aR or’ dr’ OR 


(13.171) 


Given three generic functions of R and z, show that the commutator introduced in 
Eq. (13.103) obeys the rules of Lie algebra (see also Section 9.3.1). Moreover, from 
Exercise 13.24, show that for generic spherically symmetric functions u(r) and v(r) 
and generic f(R), in the untruncated case 


[f(R)u(r), v(r)] = rr [ u(x)u'(x)dx, (13.172) 
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where v/(r) = dv(r)/dr. Finally, show that for f(m) with m2 = R?/a* + 
z’/(@’a’), 


I R = f'(m) / 
[fam),vr)] = (1 :) al v' (x)dx, (13.173) 
qd av J; m 


where the integral is evaluated at fixed R and m in the integrand is intended to be 
expressed in terms of the spherical radius x (see also Exercise 13.30). 


For an axisymmetric stellar system with f = f(€, Jz), by using Eq. (13.109) and 
the definition of A, in Eq. (13.101), show that 


o2(x) + Dp? (x) = 02 (x) + Ag (x) sin’ g sin? i (13.174) 


(i.e., the sum in Eq. (13.174) is independent of the specific decomposition adopted 


to break the degeneracy of azimuthal motions 7 = er + 04). Finally, rewrite 
Eq. (13.109) in the case of the Satoh decomposition shown in Eqs. (13.104) and 
(13.105). 

The (axisymmetric) constant-density ellipsoid of flattening g, is obtained from Eq. 


(13.115) with n = 0. From Eqs. (13.59), (13.116), and (13.117), first show that 
4qzp(0)a} 20(0 
M= ee 5) = 2? Ae, 0, ~ 0.608, (13.175) 
a 


where @ and @ are given in Eqs. (13.145) and (13.146). Then, from Eq. (13.120), 
show that the potential inside the ellipsoid is 


#(R,2) = —Go Oat [aig.wo(0) — w1 R? — w3?"], (13.176) 


where wo(0), w, = w2, and w3 are given in Eqs. (2.67), (3.62), and (3.63). By 
integration of Eqs. (13.102) and (13.103), show that 


0? =Gp(O)aig;w3(1— m7), Ag =2xGp(O)aj(wi — q2w3)R*, (13.177) 


and verify that wy — gq2w3 > O for 0 < q, < 1, with A, = 0 for g, = 1. For the 
projected fields in Eqs. (13.109) and (13.114), prove that in the Satoh decomposition 


Thos €.1) = —ky/ 20 Gp(0)a?(w1 — g2ws) & sini, 


2 2 
— w3(1 — £2) be a 
FF 9566.1) = 2x Gp (Oat E ; 3 + (1—k?)(w, — q2w3)é vi] : 
(13.178) 
and explain why Us = Crist so that from Eq. (13.112) also OP og = a. 


Finally, evaluate the aperture velocity dispersion over a generic isophotal ellipse @ 
from Eq. (13.114), and with the help of Eq. (13.117) show that 
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2 mGpO)ay Ay + (1 —k*) Ap sin? i 
of = 3 ide (13.179) 


where 


Ay = 2q2ws3[l — (1 — €2)9/7}, Ap = (wy — g203) [2 204P 3h 1= e] 
(13.180) 
See also Lanzoni and Ciotti (2003). 


With this exercise, we illustrate the importance of the relative shape of the density 
and potential on the positivity of A, in Eq. (13.103). Consider for simplicity an 


untruncated stellar system with density profile p,.(m,) and pi.(ms) = dpx/dmy < 
0. Let ép(mp) be an ellipsoidal potential (see also Exercise 2.9) with $j, (mn) = 
ddy(mp)/dmy = 0, and finally 
R2 22 R2 2 
2 2 
m= >+a Ss Mh oy (13.181) 
ab” alga Paka 
Show that 
1 1 )\ R? £% [ph (m) |b, mn) z 
pda =(=-—=) 5 / Eee (13.182) 
G 4G) aay Je mM,Mh 


and discuss the sign of A, as a function of the two flattenings g,. and gp (see Barnabé 
et al. 2006 for further discussion in the framework of fluid dynamics). Obtain the 
analogous expression for peor. 

With this exercise, we address the problem of the sign of A, in Eq. (13.103) for self- 
gravitating (axisymmetric) ellipsoidal models. Let » = p(m), with m2 = R?/a? + 
z?/(a*q7) being an untruncated ellipsoidal model, so that the oblate case is obtained 
for 0 < q < 1 and the prolate case is obtained for g > 1. Show that in the self- 
gravitating case 


2n Ga(1 — q?)R* ia |ol.(m)|z dz - TPx(Mz) dt 
Zz 0 


q JAC) (a? + t)(a2q? +7)’ 
(13.183) 


Px Ao = 
m 


and obtain the analogous expression for pee. Conclude that it is possible to have 
an oblate isotropic rotator, but that this is impossible in the prolate case. 

Show that for the axisymmetric Jaffe stellar density profile in Eq. (2.83) with y = 2, 
aq, = ay, M = Mx, po = M,/(41a3), and in the spherical Jaffe potential in 
Eq. (13.39) with y = 2, M = RMy, re = Eax, Uo = GM,/a,, Eqs. (13.102) 
and (13.103) can be expressed in terms of elementary functions. In the oblate case 


> po%n»R Y° [ dt 
Co, = — : 
: Ge arccoshe (X + cht) (Y + sht)? shr cht 


(13.184) 
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WIRY pe Y + 2shr)dt 
prdg = OO f eee a, GS) 
qx arccoshe (X + cht) (Y + sht)3 sh° t cht 
where 
E qx S 
x= = Y= = C= =, (13.186) 
V1—q?R Vl—42R V1—q?R 
2 s? 1 ~ > r ~ R 
dx qx. a a 


and ¢ > | over all of the space. Three interesting limit solutions are obtained for 
qx —> 1 (the isotropic spherical limit; see Ciotti and Ziaee Lorzad 2018; Ciotti 
et al. 2020), the BH-dominated case (€ — O and RM, = Mgy), and the SIS 
potential® (R = uzé / WY, and € > oo). Hints: From the previous exercises, we limit 
ourselves to the oblate case. Express the ellipsoidal coordinate m in terms of r as in 
Eq. (13.187); notice that this transformation holds for generic ellipsoidal models in 
a spherical potential. Then use Eq. (13.173). In order to obtain the explicit form of 
the integrals, a standard approach would be to transform the hyperbolic functions in 
their exponential representations to reduce the integrand to a rational function and 
to use the Hermite partial fraction decomposition. 

13.31 Consider the stellar oblate axisymmetric power-law ellipsoid in Eq. (5.64) with g, = 
qx and m = m,. Following Exercises 13.28—13.30, show that the two-integral Jeans 
equations can be solved in closed form in the case of a potential dominated by a 
central BH, or by the SIS of circular velocity ve in Eq. (2.63), with 


wyytl _ 
pa gs 2), Oe 
2 ) 
2 2q2 2 2 c 
PxOZ = po X 3 (13.188) 
ugY” y2-y 1 
B( =, ; , (SIS) 
24x 2 2 °¢? 
wiyytl 
Mn Yp ver Ye - BH 
2q2 2 2° ¢2 
pxAo = (0 X 2yy 4 > 1 (13.189) 
U Y Y Y. 
on B ( gt =) , (SIS) 


where B(a;x,y) is the incomplete beta function in Eq. (A.55) and Y and ¢ are 
defined in Eq. (13.186). The nearly spherical case can be obtained by expansion 
for qx — 1 (e., € — oo) and Eq. (A.56), or by mass-unconstrained homeoidal 
expansion of the power-law ellipsoid (see also Ciotti and Bertin 2005; Riciputi et al. 
2005). 


6 The sum of these two limit cases will give the solution of the Jeans equations for the ellipsoidal Jaffe model in a total potential 
of the SIS plus a central BH. Note that analogous explicit solutions can be obtained for the ellipsoidal Hernquist model. 
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13.32 Solve the Jeans equations for the power-law torus in Exercise 2.14 under the 


13.33 


assumption of an f (€, Jz). Show that 


2 = An Gpnr2 — a + R? (13.190) 
Oo, = r 3 - 
z Me oe | eo sw Ted) 
Ag =4nGpnr2 ed Zia R? (13.191) 
= 470 . . 
: Pals ae DO —e) |(@—-DG—a) e—1 


What happens in the isotropic case? Compare v2 and oF in the equatorial plane (see 
Ciotti and Bertin 2005 and Exercise 5.7). Repeat the exercise adding a central BH 
of mass Mpu. 

We recall three of the most important stability criteria for stellar systems, mentioned 
at the end of Section 6.2; the inequalities mark stability. The radial orbit instability 
indicator for spherical stellar systems can be written as 
2 Krad = 4 2 
Kan 2+ Wr/ Krad 
where Kyaq and Kian are, respectively, the kinetic energies associated with the radial 
and tangential components of the velocity dispersion tensor of the density compo- 
nent of interest, and the last expression is obtained from the virial theorem (prove 


it!). The (local) Toomre stability criterion for self-gravitating stellar disks can be 
written as 


169) 


ie (13.192) 


ORKR 
3.36GX 
where op is the radial velocity dispersion in the disk, kr the radial epicyclic fre- 
quency, and finally & is the disk surface density at radius R. The Ostriker—Peebles 
stability criterion for axisymmetric systems is given by 


Q(R) = 


> 1, (13.193) 


_ Kr 
|Wr| 


where Kg; is the kinetic energy associated with the streaming motions (sometimes 
called “ordered” motions) of the density component of interest. 


< 0.14, (13.194) 
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In this last chapter, we discuss a final theoretical step of the moments approach illustrated in 
Chapter 13: under the assumption that the macroscopic profiles (e.g., density and velocity 
dispersion) of each component are known, there is a possibility of recovering the phase- 
space distribution function (DF) of a model and checking its positivity (i.e., verifying the 
model consistency). The problem of recovering the DF is in general a technically difficult 
inverse problem, and even when it is doable, unicity of the recovered DF is not guaranteed, 
so that a simple consistency analysis is quite problematic. Fortunately, there are special 
cases when (in principle) the DF can be obtained analytically (generally in integral form), 
and in these cases a few general and useful consistency conditions can be proved, such as 
the so-called global density slope—anisotropy inequality (GDSAI). The student is warned 
that this chapter is somewhat more technical than the others; however, the additional effort 
needed for its study will be well repaid by the understanding of some nontrivial results 
allowing for the construction of phase-space consistent collisionless stellar systems. 


14.1 Recovering the DF 


As we have seen in Chapter 13, in the study of stellar systems based on the moments 
approach, the density distribution is given and specific assumptions on the internal dynam- 
ics of the model are made. In practice, one usually starts from the density profile and then 
solves the Jeans equations due to more or less motivated closure relations. Unfortunately, 
the fact that the Jeans equations admit a solution that is not trivially unacceptable! is not 
sufficient to guarantee that the model is viable: the minimum requirement to be met is that 
the DF of each physically distinct component must be positive definite. A model satisfying 
this requirement is called a consistent model. If the DF can be recovered, a positivity check 
should be performed, and as discussed in Chapter 9, in case of negative values the model 
must be discarded as unphysical, even if the kinematical profiles look satisfactory. 
Remarkably, under special assumptions about the geometry/internal dynamics of the 
models, inversion formulae exist so that the DF can be obtained, usually in integral form or 


| For instance, a somewhere negative squared velocity field; for example, see Eq. (13.103) and the subsequent comments. 
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as a series expansion (see Binney and Tremaine 2008; see also Cuddeford 1991; Dejonghe 
1986, 1987a; Eddington 1916; Fricke 1952; Hunter and Qian 1993; Lynden-Bell 1962a; 
Merritt 1985a; Osipkov 1979 and references therein). However, in these fortunate cases, 
the difficulties inherent in the operation of recovering analytically the DF also prevent in 
general a simple consistency analysis, and numerical inspection of the inversion integral 
is required. Fortunately, at least in some special cases, criteria for phase-space consistency 
that can be applied without an explicit recovery of the DF are known. Finally, it is important 
to realize that phase-space consistency is a much weaker requirement than model stability 
(see Section 6.2), and consistent but unstable models should not be accepted as viable 
equilibria to describe real stellar systems. 

In the first section of this chapter, some of the known inversion formulae that can be used 
to recover the DF in special cases are illustrated, and in the second section the associated 
consistency criteria are proved and discussed. All of the presented results (if not explicitly 
stated) apply to each density component px of multicomponent systems, and we maintain 
as far as possible the nomenclature introduced in Chapter 12; however, in order to avoid 
clumsy notation, the subscript “k” used to label each component is not reported. 


14.1.1 Multicomponent Isotropic Spherical Models 


We start by presenting the seminal result from Eddington (1916), namely the inversion 
formula for the DF of spherically symmetric, globally isotropic stellar systems. Following 
Chapter 12, in a multicomponent spherical system, let 


f =h(E)O(E — &) (14.1) 


be the DF of the isotropic k-th density component = px(r) immersed in a total potential 
Wr(r), where @ is the Heaviside step function in Eq. (A.99) and & is the component’s 
truncation energy. Then 


1 d f& dp dr 
/8r2 dE Et dr JE — Uy 


ot [ dp dW dp/dW¥T|y.=¢, 
V8r2 Je, dW2VJE- Wp 892. /E—&’ 


where it is assumed that the density p of the k-th component is written as a function of 
the total potential Wy (i.e., o = p(Vr7)). In principle, this is always possible, because from 
Newton’s second theorem in spherical systems the potential is a monotonic function of the 
radius, being dWr/dr = GMy(r)/r?. 

The proof of Eq. (14.2) is simple. From Exercise 12.1, p(Wr) = 42 Ee /2(U7 — E) 
h(E)dE; after differentiation with respect to Wy, the integral is cast in a form that is suitable 
for the Abel integral inversion in Eq. (13.1), and so the first identity is proved. The second 
identity is obtained by integrating by parts from the first identity with respect to Vy, and 
then by differentiation with respect to €. Note that for untruncated regular systems of finite 


h(é)= 


(14.2) 
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total mass (or more generally when the total potential at infinity can be set equal to zero; see 
Chapter 2), by fixing € = 0, the last term in Eq. (14.2) vanishes (see also Exercise 14.1). 


14.1.2 Multicomponent Osipkov—Merritt Spherical Models 


A generalization of Eq. (14.2) holds in the case of multicomponent spherical systems with 
radial Osipkov—Merritt (OM) orbital anisotropy (Section 12.2.2). In this case, we assume 
that the k-th component of a multicomponent spherical system is supported by a DF of the 
form 


JZ 
f=h(Q)0(Q-O), Q=E-—,, (14.3) 
ar; 
where Q, is a truncation value; note that in principle different components of the system 
can have different values of the anisotropy radius and different functions h, and consistency 
requires that each h is nowhere negative. It is easy to prove that 


h(Q) = an fa 
a dQ d¥y JQ — Vr 
__! [ dq dWr 2 do/d¥tly,=0, (14.4) 
V8n2 Jo, dv2/O—Vr  V8n? /JO-O, 
where from Eq. (12.45) 
p(r) -- 
4 (1+) om. — 


We call this function the OM augmented density. In fact, the proof of Eq. (14.4) is identical 
to that of the isotropic case once we introduce the augmented density into Eq. (12.42). For 
some explicit examples of one- and two-component systems that can be solved analytically, 
as well as in the presence of a central black hole (BH), see, for example, Exercise 14.2; see 
also Binney and Tremaine (2008), Ciotti (1996, 1999), Ciotti and Ziaee Lorzad (2018), 
Ciotti et al. (2009), Hernquist (1990), and Jaffe (1983). 


14.1.3 Multicomponent Cuddeford and Generalized Cuddeford 
Spherical Models 


In the case of a density component » = px in a multicomponent spherical Cuddeford 
(1991) system, from Eq. (12.53) the associated DF is given by 


f =J*n(Q)0(@-O), a>-l, (14.6) 


where again we consider the possibility of truncation; in general, the values of a, rg, and 
Q, will be different for different components. 
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From Eq. (12.54), the augmented density can be written as 
pir) _ 2 Te@+3/2) (PV pw) 
A(r,a) f/x Pat) r2 2a? 


and an inversion formula (reducing to the radial OM case for a = 0) can be obtained as 


or) = (14.7) 


follows: after 
1 
m = int (« + 5) +1 (14.8) 


differentiations of Eq. (12.54) with respect to Wy, the resulting identity (prove it!) can 
be Abel inverted. In practice, one must perform enough differentiations as to produce a 
negative exponent aw + 1/2 — m > —1 in the power-law kernel; note that, as a > —1, we 
have m > 0. 

When @ + 1/2 is not integer 


(—1)"*! cosam F(a + 3/2 — m) d ie do dW iG, 

2/872 (a + 3/2) dQ dU (Q — Wy yet3/2-m’ 
and in the particular case of untruncated systems with finite total mass, an integration by 
parts show that 


2 go dWy 4 ee dV eT 
= dvr (0 qe Jy. dart (= tage aes 


As expected, Eqs. (14.9) and (14.10) reduce to the OM case for a = 0. When a = n — 3/2 
with n being an integer > 1, then m = n, and the solution of the resulting Volterra integral 
in Eq. (14.9) is given by 


h(Q) = 


1 d™o 
2/8x(m — 1)! dT yo | 


and so in this very special case the DF is recovered analytically while avoiding integration!? 
As anice exercise, the student is encouraged to work out the explicit expressions of the aug- 


h(Q) = (14.11) 


mented density and of the inversion formula for a generic component of multicomponent 
generalized Cuddeford spherical systems, whose DF is given in Eq. (12.64). 


14.1.4 Purely Radial Spherical Models 


We finally discuss the inversion problem for the special but conceptually relevant case 
of spherical systems made of purely radial orbits. In this case, we consider a density 
component p = px with the DF 


f =h(E)8(J*)0(E — &), (14.12) 


2 int(x) is the largest integer < x. For example, int(1/2) = 0, and som = | for OM models, while m = 0 for Cuddeford models 


with —1 <a@ < —1/2. 
3 In eq. (30) of Cuddeford (1991), the (m — 1)! in the denominator is missing. See also eqs. (49) and (51) of Baes and Dejonghe 
(2002). 
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where the Dirac 6-function guarantees that only radial orbits are present. The integral 
equation to be inverted is given by an equation analogous to Eq. (12.35), where & is 
now the lower limit of integration. Noticing that, at variance with the Eddington, OM, 
and Cuddeford inversions, the preparatory differentiation with respect to the potential is not 
required in order to apply the Abel inversion theorem, and from Eq. (13.1), we immediately 
obtain (prove it!) 


1 dé o 
J2n?2 dE & VE — Wy 


il [ dg _d¥r_, off) 
~ J2n? Je, Wr JE— Wy 2? SEE 
(e.g., see Ciotti and Ziaee Lorzad 2018; Oldham and Evans 2016; Richstone and Tremaine 
1984), where again the augmented density 9 = rp is intended to be expressed in terms of 
the total potential Yr. As we will see later on, the seemingly minor difference of the missing 
preparatory differentiation has quite important consequences for system consistency. 


(14.13) 


14.2 Axisymmetric and Triaxial Models 


The problem of recovering the DF for axisymmetric systems under the assumption of 
F(E, Jz) (ue., the inversion of Eqs. (12.87)-(12.89)) is much more difficult than that of 
the spherically symmetric case. In fact, note that in all of the previous formulae the density 
must be expressed in terms of the total potential. As remarked above, in principle this is 
always possible in spherical systems. In axisymmetric systems with the vertical component 
of the gravitational field attracting everywhere toward the equatorial plane, it is instead 
possible to eliminate z between p(R,z) and Wr(R,z), and so an expression of the form 
p = p(R, Vr) can be derived. Basically, the known methods of inversion can be listed (in 
a broad sense) as follows: 


(1) Fricke method: If p(R, Vt) is expressed as a finite or infinite series of terms R® we, 
then Eq. (12.100) can be used to determine the coefficient A for each term of the 
sum (Fricke 1952; see also Kalnajs 1976b; Miyamoto 1971, 1974, 1975; Nagai and 
Miyamoto 1976). The resulting DF is given by a sum or a series of functions of the 
type in Eq. (12.99). The proof of the convergence of the obtained series is usually 
the most difficult step of this procedure, and sometimes it may even require analytic 
continuation. An example of the application of the method is given by the power-law 
torus with w = 3 presented in Ciotti and Bertin (2005) in the study of homeoidal expan- 
sion (see also Exercise 2.14), with a two-integral DF expressed with hypergeometric 
function; we also recall the family of power-law tori studied by Toomre (1982), one 
of the extremely few cases of axisymmetric systems with elementary DFs (however, 
this was not discovered by using the Fricke method). Finally, a different method, 
but still based on a suitable expansion of the density, is given in Dehnen and Gerhard 
(1994). 
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(2) Integral transforms: The inversion procedure is based on classical integral transforms, 
and here we recall the Lynden-Bell (1962a) method based on Laplace transforms, 
the Hunter (1975) method based on Stieltjes transforms, and the Dejonghe (1986) 
method based on Laplace—Mellin transforms (the interested reader is invited to consult 
Dejonghe (1986) for a very detailed account of integral transform methods). All of 
these tools require the analytical continuation in the complex plane of the function 
p = p(R, Wr); therefore, they are not meant to be directly applied to observational 
data.* Very important examples of the Laplace transform method are the recovery of the 
elementary two-integral DF for the flattened Plummer sphere presented in the seminal 
paper by Lynden-Bell (1962a), and by the unexpected discovery made by Evans (1993) 
of the surprisingly simple two-integral DF of the Binney (1981) logarithmic potential 
in Eq. (13.66), successively extended in Evans (1994) to the power-law models in 
Eq. (13.68). 
(3) Hunter—Quian method: This method is based on the theorem of residues for functions 
of complex variables. The requirements on the function p(R, Yy) to be continued to 
the complex plane are much weaker than those for the integral transforms. The papers 
by Hunter and Qian (1993) and Qian et al. (1995) not only give clear expositions of the 
method, but also provide quite complete summaries and several additional references 
of the inversion methods to recover two-integral DFs. 
Stdickel models: A special class of systems amenable to inversion (as well as in the 
three-dimensional case; i.e., when the DF depends on three isolating integrals of 
motion) is represented by separable Stackel models (see Section 13.2.3), and for the 
inversion formula, see, for example, Dejonghe (1987a). 


(4 


wm 


All of these results are quite technical and well beyond the introductory level of this 
book; for a much more complete discussion of the inversion methods (as well as consider- 
ing quite sophisticated numerical techniques), the student is directed to consult Binney and 
Tremaine (2008). 


14.3 Testing the Consistency 


As was stressed at the beginning of this chapter, in some special cases, it is possible 
to investigate the positivity of the DF without explicitly recovering the DF itself. In the 
following, we present a summary of the main results regarding the consistency of spherical 
models that are simple enough to be used without major effort. 


14.3.1 Multicomponent OM Models 


We begin by discussing in detail a criterion (Ciotti and Pellegrini 1992) that allows us to 
check whether the DF of a multicomponent spherical system immersed in a total potential 


4 From Exercise 2.9, the student should prove that for a potential stratified on ellipsoidal surfaces #(m) with ¢/(m) 4 0, in the 
axisymmetric case the corresponding density can be always written as p = A(d) + R2B(o). 
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Wr, for which the orbital anisotropy of each component is of the radial OM form, is indeed 
positive without an explicit calculation of it. Of course, the isotropic case is included in the 
following results when considering the limiting case of rg > oo (the analogous discussion 
of the less natural tangentially anisotropic OM models is left as an useful exercise for the 
student). As in the previous sections, if not strictly needed, we do not indicate explicitly the 
subscript k in the density of the specific density conponent p ;, in the associated anisotropy 
radius rg, and in its potential Y,, while Ur = >> ; VY is the total potential.° 
First, we show that a necessary condition (NC) for h => 0 in Eq. (14.4) is 
ae >0 Q<V¥< WV). (14.14) 
dv ~ 
In fact, from Eq. (12.42), we have do/dWy; = 4x i h(Q)dQ//2(V7 — Q), and if 
h > O, one obtains the inequality above, being do/dWy; = (de/dV)(dWy/dW)—', and 
the last factor is necessarily positive from Newton’s second theorem.® 
For systems with Q; = 0 (the common situation for spatially untruncated systems of 
finite total mass) for which the NC is satisfied, a strong sufficient condition (SSC) for 
h > Ois 


cn ct al ” fe >0, 0< VU < VO) (14.15) 
dw | dv \ aw Been ee 


and a weak sufficient condition (WSC) for h => 0 is 


d | do (dWy\"! 
ay ee (ate >0, 0< WV < W(0). (14.16) 
dw | dw \ dw 


The proof of the WSC is obtained directly from the second identity in Eq. (14.4) for 
spatially untruncated systems, while the proof of the SSC is slightly more complex, and 
it is deferred to Exercise 14.3. 

As nice as the previous results may be, we must accept that only for very few special 
systems it is possible to obtain the explicit function e(W); fortunately, we can reformulate 
Eqs. (14.14)-(14.16) in terms of the radius instead of the potential, and in Exercise 14.4 
the corresponding formulae are derived. Some remarks are also in order. The first is that 
the violation of the NC is related only to the radial behavior of p and the value of ra, and 
so this condition applies independently of whether any other component is added to the 
model. This condition is only necessary; thus, f can be negative even for values of model 
parameters allowed by the NC. A model failing the NC is certainly inconsistent, while 
a model satisfying the NC may be consistent. Similarly, a model satisfying the WSC (or 
the more restrictive SSC) is certainly consistent, while a model failing the WSC/SSC may 
be consistent. As a result, the consistency of a model satisfying the NC and failing the 


5 The possible presence of an external potential Yext is considered for simplicity in the sum. 
6 The function Wy(W) can always be obtained (in principle) by elimination of the radius between Wp (r) and W(r) because the 
potential in spherical systems is monotonically decreasing and so invertible. 
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WSCY/SSC can be proved only by direct inspection of its DF. The second remark particularly 
concerns the limitations on r,. In fact, for each component of OM systems, one can write 


ha(Q) 


2? 
ur 


h(Q) = hi(Q) + 


(14.17) 


where the meaning of the two functions /; and ha is obvious from Eqs. (14.4) and (14.5). 
Importantly, note that a similar functional decomposition holds for the NC and the WSC, 
so that all of the following arguments can be repeated for these two conditions with the 
variable Q substituted by r, and h substituted by the augmented density a. 

Let us now discuss the positivity of h(Q) in Eq. (14.17) in terms of rg, the anistropy 
radius. We indicate with A the set of values of Q such that hj > 0 (.e., the set of values 
of Q so that the isotropic component of the DF is positive definite). Then, from Eq. (14.17) 


ra >, = |max | 0, sup |- ad | (14.18) 
i QeAy 


is the condition to be satisfied in order to have a positive definite DF over A+. Obviously, 
when h; > 0 over all of the phase space (e.g., when the isotropic component is consistent), 
A coincides with the total range of variation for Q and r, is the lower bound for the 
anisotropy radius; this is the common situation encountered in the vast majority of one- 
component systems. However, when the set A_ (complementary to A+) is not empty (i.e., 
h, < 0 over some region of the accessible phase space), a second inequality, derived from 
the request of h(Q) > 0 in Eq. (14.17), must be verified: 


ra <r} = _|inf | (14.19) 
7 hi(Q) QeA_ 


It follows that if A_ is not empty and h, < 0 for some Q € AL_, or me <r, then 
the density component under investigation is inconsistent. In summary, the allowed region 
for phase-space consistency for each component of single-component or multicomponent 
spherically symmetric and radially anisotropic OM models is r> < ra < rj. 

As remarked above, formally identical considerations can also be repeated for the NC, 
SSC, and WSC, where of course the necessary or sufficient nature of the obtained lim- 
itations must now be taken into account (see Exercise 14.5; see also Ciotti 1996, 1999; 
Ciotti and Lanzoni 1997; Ciotti and Morganti 2009; Ciotti and Pellegrini 1992; Ciotti and 
Ziaee Lorzad 2018; Ciotti et al. 1996, 2009, 2019; Tremaine et al. 1994 and references 
therein for applications of the NC, WSC, and SSC conditions to single-component and 
multicomponent OM galaxy models also in the presence of a central BH). 

Two important consequences derive from the NC radial condition. First, in the isotropic 
case (fa —> 00), the NC for consistency in Eq. (14.38) reveals that » must be a 
radially decreasing function; for instance, it is impossible to have isotropic spherically 
symmetric systems with a “hole” or a “depression” in the center. Second, in the fully 
radial case (rg — 0), the NC requires that the function r2 p(r) must be nonincreasing with 
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increasing radius. However, as we have seen in Section 14.1.4, the fully radial case can be 
also discussed directly from its inversion integral, and Eq. (14.13) shows that 


dtr*p(r)] 
dr 

is also a sufficient condition for consistency. Therefore, we have reached the interesting 
conclusion that the necessary and sufficient condition for a spherical model to be made by 
purely radial orbits is that its density declines everywhere with the radius at least as fast as 
1/r?. Again in relation to purely radial models, it is useful to mention an explicit example 
illustrating some of the subtleties that can occur when considering phase-space inversion. 
In fact, Ciotti and Ziaee Lorzad (2018) showed that, in the OM case, the (analytical) DF 
of Jaffe models (e.g., see Binney and Tremaine 2008; Merritt 1985b), for rg below a (quite 
small) critical value, is negative for some admissible value of Q, and so one would feel 
authorized to conclude that a fortiori the purely radial (r, = 0) Jaffe model is inconsistent. 
On the other hand, the Jaffe density profile (being everywhere steeper than 1/r*) obeys the 
sufficient condition for consistency of purely radial models in Eq. (14.20), as is confirmed 
by inspection of its nowhere negative analytical DF (Ciotti and Ziaee Lorzad 2018; Evans 
et al. 2015; Merritt 1985b) obtained (do it!) from Eq. (14.13). This fact clearly illustrates 
that the purely radial case obtained from OM anisotropy can be a singular limit; in practice, 
if the limit for rz — 0 of a given OM model is consistent, then it is possible to conclude 
that the purely radial model exists, but the opposite is not necessarily true. 


<0, O0<r<ow, (14.20) 


14.3.2 Multicomponent Generalized Cuddeford Models 


From the results in Section 14.1.3, it should be clear that the same arguments used to derive 
the necessary and sufficient consistency conditions for OM models can be repeated for mul- 
ticomponent generalized Cuddeford systems. However, as m preparatory differentiations 
with respect to Wr must be performed before inversion of the integral’ in Eq. (12.54), with 
each differentiation producing an integral identity, it is easy to see that we now obtain m 
necessary consistency conditions, plus a sufficient condition (Ciotti and Morganti 2010a). 
In fact (prove it!) each density component in a consistent multicomponent generalized 

Cuddeford system with a > —1 obeys m necessary conditions (NC;) 

d! fe) 

—>0, /=1,...,m, (14.21) 

dwt, 
where m is given by Eq. (14.8) and o(r) = p(r)/A(r,q@) is the augmented density. More- 
over, under the assumptions of Eq. (14.9), a sufficient condition for the non-negativity of 
the DF in the untruncated (Q; = 0) case is 


rae) 


LS (14.22) 
1 
dwt 


7 Of course, now the function A(r,q@) is given in Eq. (12.65). 
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Of course, for a = n — 3/2 and integer n > 1 (when m = n), the first m — | inequalities in 
Eq. (14.21) are still NCs, but 

a"e 59 14.23 

aun = (14.23) 
is the necessay and sufficient condition for consistency, as follows from the closed-form 
inversion formula in Eq. (14.11); moreover, the student is invited to prove that the NC and 
the WSC for OM systems in Eqs. (14.14)-(14.16) are reobtained as very special cases for 
a=O(ie.,m = 1). 

In applications, it can be useful to express the family of Eqs. (14.21) in terms of the 
radius instead of the potential (see Exercise 14.4); for example, NC; and NC2 coincide with 
Eqs. (14.38)-(14.40), where of course now g is the augmented density of the generalized 
Cuddeford component. When expressing the conditions in terms of radius, notice that NC; 
is the sole condition in which only the augmented density profile of the specific density 
component appears, while in the higher-order NC; the total mass profile Mr(r) is also 
involved (see Exercise 14.6). Also notice that from the monotonically decreasing nature of 
Wr with the radius, the sign of the NC; inequality changes to (—1)!. 

Finally, we mention a result that (at the present stage of discussion) seems to be noth- 
ing more than a mathematical curiosity, and that was obtained by direct computation in 
Ciotti and Morganti (2009, 2010a) (see also Exercise 14.7): that for each component p of 
multicomponent generalized Cuddeford systems, if we define the logarithmic density slope 
y(r) = —d1np/dInr, then the NC, holds if and only if Vr 


y(r) 2 2B(r), (14.24) 


where f(r) is the anisotropy profile in Eq. (12.68). We will refer to the inequality in 
Eq. (14.24) as the GDSAT. 


14.4 The GDSAI 


From the GDSAI inequality (14.24), it follows that, at least for the quite large family 
of multicomponent generalized Cuddeford systems (containing as very special cases the 
isotropic, constant-anisotropy, Cuddeford, and OM models), the density slope of each con- 
sistent component is related to the maximum amount of anisotropy that can be supported at 
each radius by the component itself. A similar and complementary result, the central cusp— 
anisotropy theorem (An and Evans 2006; de Bruijne et al. 1996), was in fact already known: 
this theorem states that Eq. (14.24) holds (1) in all consistent constant-anisotropy systems 
with 6 < 1/2 and (2) asymptotically at the center (i.e., for r — 0) of any consistent 
spherical system with a generic anisotropy profile. Therefore, Eq. (14.24) unexpectedly 
revealed that the GDSAIT holds not only at the center, but at all radii in a quite large class of 
spherical systems, whenever their DF is positive. When considering the two previous results 
together, it is natural to ask whether the GDSAT is necessarily obeyed by generic spherically 
symmetric systems with positive DFs. This possibility is reinforced by the fact that the 
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end products of N-body numerical simulations appear to follow interesting correlations 
between 6 and y (e.g., see Hansen and Moore 2006). 

Quite interestingly, a (partial) answer to this conjecture can be given. In fact, Ciotti 
and Morganti (2010b) considered the special family of stellar systems in which the radial 
pressure p,; defined in Eq. (12.117) is a factorized function of the radius and of the total 
potential, so that from the first identity of Eq. (12.119) 


p(r, Ur) = A(r)B(Yr). (14.25) 


Of course, while the function B in the expression above is the derivative of the potential 
dependent factor in p;, the radial function A is the same (see also Baes and Dejonghe 2002; 
Baes and van Hese 2007). We note that the multicomponent generalized Cuddeford models 
(and therefore also the constant-anisotropy, OM, and Cuddeford models) belong to such a 
family of systems. From the second identity of Eq. (12.119), it follows that, independently 
of the specific radial dependence of Vy, 


(14.26) 


Moreover, it is a simple exercise (do it!) to show from the first identity of Eq. (12.119) 
and from Eq. (14.26) that the logarithmic slope of the density profile in Eq. (14.25) can be 
written as 


dino — dinB dinWry 
dinr = dinWy dinr 


y“)= + 2B(r), (14.27) 
so that we can express the quantity y(r) — 28(r) in terms of the logarithmic derivative of 
the 6 function. Now, since from Newton’s theorem Wr(r) is a monotonically decreasing 
function of the radius, Eq. (14.27) proves that in all spherical systems whose radial pressure 
is a separable function of the radius and total potential, so that p = A(r)B(Vr), the global 
GDSAI holds if and only if B(YT) is a monotonically increasing function of Yr. Therefore, 
if one is able to show that in all factorized consistent systems the B function is necessarily 
monotonically increasing, then the GDSAI is established at once for all of these systems, 
without the need for lengthy algebraic calculations. We will now illustrate the power and 
elegance of this result with some explicit examples. 


14.4.1 The GDSAI for Generalized Multicomponent Cuddeford Systems Again 


Following Ciotti and Morganti (2010b), we begin by considering again the case of gener- 
alized multicomponent Cuddeford models, where for a > —1/2 we obtained the GDSAI 
in Eq. (14.24) by explicit computation (see Exercise 14.7). From Eqs. (12.65) and (14.25) 


A(r) = (21)?! 


2a : 
2V(a+ Dr ‘ Wj (14.28) 


T'(a + 3/2) (Lt r2/r2yetl 


i 
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Ww 
B(Wr) =) “(wy — 9)*t'/2n(0)dQ, (14.29) 
0 


where differentiation shows that B is a monotonically increasing function of Yr whenever 
h > Oanda > —1/2, proving again in a very direct way the validity of the GDSAI for 
this class of models. We are left with the case —1 < a < —1/2, where Eq. (14.22) with 
m = 0 shows that the GDSAI is only a sufficient condition for consistency. Indeed, for 
—1 <a < —1/2, it is not possible to compute the derivative of Eq. (14.29) directly. How- 
ever, this problem can be circumvented by first integrating by parts and then performing 
differentiation, obtaining (prove it!) 
dB 


a = et h() + | (Wr — Q)**'7n'(Q)dQ. (14.30) 
T 0 


From Eq. (14.30), we conclude that the GDSAI is again necessary for phase-space con- 
sistency in all generalized multicomponent Cuddeford systems with —1 < a < —1/2 
and h'(Q) > 0. In summary, for a > —1/2 the GDSAI is necessary for consistency, for 
a = —1/2 it is equivalent to consistency (i.e., necessary and sufficient), and for —1 <a < 
—1/2 it is just sufficient, but also necessary if h'(Q) > 0. 


14.4.2 The GDSAI for the Baes and van Hese (2007) Anisotropic Models 


Again from Ciotti and Morganti (2010b), we now show that the GDSAI is obeyed by 
another quite large family of consistent models not belonging to the family of generalized 
Cuddeford models. Baes and van Hese (2007) considered the family of density profiles 


Bo—Boo 
# —2Bo pa\ 3 /wy\P ws\4d 
= — 1+—- — 1-—], 14.31 
emm(=) (+e) CR) O-m). an 


where 6 > 0,g < 0,5 > 0,0 < WY < Wo, and Wo is the value of the central relative 
potential. From Eq. (14.26), the anisotropy profile is 


= Bo + Boo(t/Ta)” 
PO TE Clty? 


so that Bo and By are the values of the orbital anisotropy parameter at small and large 


(14.32) 


radii, respectively; since by construction B(r) < 1, both Bo and By are < 1. Note that 
the requirement of finite total mass for the density distribution (14.31) (e., Yr) ~ I/r 
for r — oo) translates into the condition p + 28. > 3, and from the limitation on By, 
it follows that p > 1. Now, it is easy to show that dB(W)/dVW > 0 for0 < VW < Wo, 
when s > 0, q < 0, and p > 1, and thus the GDSAI is necessarily obeyed by these 
profiles as well. The analytical DF associated with these systems (Baes and van Hese 2007) 
is given by 


E p+ks—3/2 PP 
>> (=) A(E) gx (35) ; (14.33) 
k a 
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where the g; are expressed in terms of the Fox H functions (e.g., see Mathai et al. 2009). 
The anisotropic Hernquist model considered in Baes and Dejonghe (2002) also belongs to 
this family. 


14.4.3. The GDSAI for the Multicomponent Generalized Cuddeford 
and Louis (1995) Anisotropic Polytropes 


We finally consider the multicomponent generalization of the family of anisotropic 
polytropes (Cuddeford and Louis 1995; Louis 1993; see also Dejonghe 1987b for the 
anisotropic Plummer model in the family) considered in Ciotti and Morganti (2010b). The 
DF of each density component is given by 


JZ 
E,J?) = ET 0(E) Y wih), ki = 55, 14.34 
FES) ©) Dwi) er: (14.34) 
with different anisotropy radii raj and positive weights w;. In Exercise 14.9, we prove that 
1 7 1 [% h(k)dk r2 
= 2/2B(¢,~) ve? S* wind | nes 14.35 
p mB 4.5) Wy win ) bem "=2 (14.35) 
3 1/2 1 [% h(k)dk 
= 278 ( 3) war? win! / —— 14.36) 
Pr q 5 T dX inj eae ( 


Note that the convergence of the density integral requires g > 0 near € = 0. As p, and p 
are in factorized form and B « wel the GDSATI is satisfied whenever g > 1/2. The 
situation is less straightforward in the interval 0 < g < 1/2. In fact, if a consistent model 
exists in this interval, then it will represent a case of violation of the GDSAI, similarly to 
the case of the models in Section 14.4.1 for —1 < a@ < —1/2. Therefore, it is natural to ask 
whether it is possible to construct a consistent dynamical model with g > 0, but violating 
the GDSAI (q < 1/2). Note that in the self-consistent case of finite total mass (when 
Wr = V ~ 1/r forr — ov), volume integration of Eq. (14.35) obtained with inversion of 
the order of integration shows that g > 3/2 is necessary in order to have a finite mass for the 
component under scrutiny. Therefore, the GDSAI is a necessary consistency condition for 
anisotropic polytropes in generic external potential when g > 1/2, and for all finite-mass 
self-consistent models. 

The possibility remains open for the existence of infinite-mass, self-gravitating, and 
consistent anisotropic polytropes with 0 < q < 1/2, or of Cuddeford models with angular 
momentum exponents in the range —1 < a < —1/2, which would violate the GDSAI. 
Therefore, we can conclude that there is the possibility of the existence of consistent models 
with factorized densities as in Eq. (14.25) but nonmonotonic 6 functions that violate the 
GDSAL In fact, recently it has been proved that these (quite peculiar) systems do exist, and 
for recent progress on this interesting problem, the student is invited to consult An et al. 
(2012), Barber and Zhao (2014), and Van Hese et al. (2011). Of course, much less is known 
for models with a nonseparable density profile. 
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Exercises 


The interested reader should prove the following statements about the Eddington 
inversion of spherical, isotropic systems: (1) for all spatially untruncated systems of 
finite total mass, limy,—.9 dox/d¥r = 0, and so one term in the second identity 
in Eq. (14.2) is absent; and (2) if hy ~ (E — &)* forE > &, thena > —1 to 
ensure the convergence of the density px at the edge of the system. When a > —1, 
then limy,—¢, ex = 0. Moreover, dox/d¥r ~ (Wr — E,)¢+!/2, and so lim,-+;, = 
dpx/dr = —o (for —1 < a < —1/2), dox/dr, = —const. (for a = —1/2), and 
finally dox/dr, = 0 (for a > —1/2; 1.e., the mass density goes smoothly to zero at 
the edge of the system). 


With this illustrative exercise, we recover one of the simplest explicit cases of the 
OM phase-space DF. Consider the single-component Plummer model in Eq. (13.36) 
and define M, = M/ 7 and Vy = GM/r¢. Integrate Eq. (14.4) in the untruncated 
case (Q; = 0) and show that f = (M,/W;/*)h(q), where 


24./2q7/? — 3q3/2(7 — 16q? 
a/2d qrie( q°) ae. (29=— <1. (14.37) 
n 


h(q)= : 
‘ i 7/273 52 re 


Prove the SSC in Eq. (14.15) for the k-th component of a multicomponent OM 
system. Hint: Let 1(Q) = SP g(a — x)~'/?dx, with x = Wy and g(x) = 
dox/d¥r; we look for a sufficient condition to have 1(Q)’ = dI(Q)/dQ > 0. 
Change the integration variable to x/Q and, after differentiation with respect to 
Q, restore the original integration variable, obtaining /(Q)’ = Qu! i [g(x)/2 + 
xg(x)'](Q —x)~'/*dx = Q7! Jor lee)/x¥ J/x(Q —x)~!/*dx. The SSC is proved 
by requiring positivity of the derivative in the integrand and finally setting Yr = 
Wr(Y;) (see Footnote 6). 


By changing variable from potential to radius, show that Eqs. (14.14)-(14.16) can be 
rewritten as functions of the sperical radius r, respectively, as 


do(r) 


20. 02727, (14.38) 
dr 
d fd 2 
OTA) 6 Ge p25, (14.39) 
dr dr My(r) 
2 
ee | ee See. (14.40) 
dr dr Mr(r) 


With this exercise, we elaborate further on the Plummer model, and we compute 
explicitly the critical values of the anisotropy radius for consistency in the case of 
OM anisotropy. By using the DF in Eq. (14.37) and the conditions in Eqs. (14.38)— 
(14.40) or in Eqs. (14.14)-(14.16), show that the minimum values of Sa = Ta/Tc 
obtained from the NC, the DF (the true limit), the SSC, and finally the WSC are 
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s, (NC) = ie < s, (DF) = : < s, (SSC) = 1 < s, (WSC) = i (14.41) 


14.6 Quite often the consistency is studied for multicomponent systems (e.g., a stellar 


14.8 


14.9 


distribution embedded in a dark matter halo and/or with a central BH). Show that in 
OM models a sufficient condition for the validity of the WSC(p, Yr) for a specific 
density component in the total potential Wy is to have the WSC(p, Y) verified 
separately in the potential Y; of each component. Hints: Write Mr(r) = )°, Mk(r) 
in Eq. (14.40), evaluate the outer derivative, and show that 


MP (r) 
Mi(r) 


WSC(p, Yr) = 
k 


WSC(p,%),  O<r <0. (14.42) 


Prove by direct evaluation that the GDSAI in Eq. (14.24) holds for each component of 
generalized multicomponent Cuddeford systems with a > —1/2. Hint: First notice 
that fora > —1/2 from Eq. (14.8) m > 1, and so NC exists. Then construct the 
augmented density 9 = p(r)/A(r,a) by using Eq. (12.65) and show that NC; in 
Eqs. (14.21) and (14.38) can be rewritten as 

o> do __odlin(p/A)_ _@ d\n A(r,a) 
“dr or dinr dinr 


Finally, from Eqs. (12.65) and (12.68), show by direct computation that fora > —1 


din A(r,a) _ C(r,a) 
~ dinr | 2B(r,a) 


Eo + (14.43) 


= 2B(r). (14.44) 


Starting from the Jeans equation (13.82) for generic spherically symmetric and 
anisotropic systems, show that the GDSAI in Eq. (14.24) can be rewritten as 


(r) apy = 5 (St at) >0 (14.45) 
acl Saar: dr dr) , 


By integration over the velocity space of Eq. (14.34) using Eqs. (12.25) and (12.27), 
show that Eqs. (14.35) and (14.36) hold. Then, deduce that 


a ce Wr 


p  2q+1’ 
and from Eq. (14.45) prove again that the DGSAT is satisfied for g > —1/2. 


(14.46) 


Appendix 


Mathematical Background 


We summarize here the most important mathematical results used in the text (and useful 
for solving some of the proposed exercises). Overall, this appendix should be intended 
more as a selection of mathematical results relevant to stellar dynamics than an organic 
presentation of definitions and theorems. A good knowledge of linear algebra and calculus 
at the undergraduate level is assumed, and the interested reader is encouraged to refer to 
some of the excellent classical treatises of mathematical physics such as Arfken and Weber 
(2005), Bender and Orszag (1978), Courant and Hilbert (1989), Dennery and Krzywicki 
(1967), Ince (1927), Jeffrey and Jeffrey (1950), Kahn (2004), and Morse and Feshbach 
(1953). To help with further study, in the following sections more specific references are 
sometimes also provided. 


A.1 Identities of Vector Calculus 


The literature on vector calculus and linear algebra is immense. Here, we only mention 
three books that stand out for clarity and rigor, namely Aris (1989), Borisenko and Tarapov 
(1979), and Cullen (1990). In the following, we recall the main identities of vector calculus 
and we restrict ourselves to real vector spaces over ‘t”, whose defining axioms are assumed 
to be known. The inner product obeys the standard rules of commutativity 


(a,b) = (b,a), (A.1) 
linearity 
(a, Ab + pwc) = A(a,b) + (a,c), (A.2) 
where A and py are two real scalars, and finally positive definitness, 1.e., 
(a,a)>0, and (aa)=O <=> a=0. (A.3) 
The norm (the length of a vector) is introduced as 
all = V(aa), (AA) 
and geometrically 


(a,b) = all ||bl| cosé, O<O<7, (A.5) 
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where 6 is the angle between the two vectors, so that the Cauchy—Schwarz inequality 
|(a,b)| < |lal] [|b] (A.6) 


follows. Two vectors with zero inner product are called orthogonal, and a vector with 
unitary norm is called versor. The norm obeys the property of homogeneity 


||Aal] = [A] llall, (A.7) 
to triangular inequality 
la + bl| < [lal] + [Ib], (A.8) 
and finally, if two vectors are orthogonal, the Pythagorean theorem holds 
lial? + (Ib? = lla + Bll’. (A.9) 


By using the norm, it is possible to introduce in a natural way the distance between two 
points by defining d(a,b) = |la — b]l; it is easy to prove that the formal properties of 
distance are in fact verified by this function. 

In linear algebra, it is proved that n orthogonal vectors (versors) in )t”, 


(e;,€;) = dj; (A.10) 


(where 6;; = 1 ifi = j and 0 otherwise is the Kronecker symbol), are a basis for the vector 
space over ‘}t”, so that a generic vector can be expressed as a linear combination of the 
basis as a = aje; with the convention of repeated indices. Note that a; = (a,e;). From the 
linearity of the inner product and orthogonality of the basis, it follows immediately that 


(a,b) = aj b 6; = ajb;, (A.11) 


where b = b;e;; the expression in Eq. (A.11) is also known as a dot product. Moreover, 
for a generic n x n square matrix 7 over St”, with the expression 7a we intend the usual 
row-by-column product of linear algebra, and it is a trivial exercise to show that 


(Ra,b) = (a,R'b), (A.12) 


where R! is the transpose of R (1.e., (R"); j = (R)ji). It follows that if Fe is orthogonal 
(ic. RR! = R'R = I, where J is the identity matrix), then 


(Ra, Rb) = (a,b), (A.13) 


a formalization of the intuitive geometric fact that a rigid rotation of space does not change 
the length and angle between two vectors. The cross (or vector) product — an operation 


! For typographic reasons, vectors are written in the text as row vectors, even though they should be intended as column vectors 
for consistency with the rules of matrix multiplication. The student will also notice that we do not introduce the concept of 
covariant and controvariant bases (e.g., see Narashiman 1993). 
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restricted to vectors of 7 only — is here indicated (with an abuse of notation) with the 
symbol of exterior product / instead of x, so that we write a A b. We recall that 


lla A bl] = |lal] ||b]] siné = Vall Ilbl|? — (a,b)?, O<O<7z, (A.14) 


where a A b is perpendicular to the plane containing a and b and directed so that the 
three vectors a, b, and a /\ b (in this order) are positively oriented. Geometrically, the cross 
product can be visualized as the oriented area of the parallelogram with the two vectors a 
and b as sides. The main properties of the cross product are anticommutativity 


aAb=-—bdAa, (A.15) 
so that a A a = 0, linearity 
ad (Ab+ pc) = AaAD+ padre, (A.16) 
and the Jacobi identity 
ad (bAc)+bA (cAa)+CcA (arb) = 0. (A.17) 


It is a simple exercise to show that in any positively oriented Cartesian reference system, 
where 


Qy A@y =O, O; NC, =Uy, Cy AC, = ey, (A.18) 


the following formal expansion holds: 


ey ey & 0 -az a 
avNb=|aq, @ al= a3 0 —a, |b. (A.19) 
by bo bg —a42— ay 0 


Equation (A.19) reveals an important property of the cross product: its relation with 
antisymmetric matrices in 3°. In practice, the action of a generic antisymmetric” matrix 
R! = —R ona vector x € ‘i? can be also represented as a cross product with a vector 
a € 3° related to the elements of R as in Eq. (A.19), and vice versa. Finally, an important 
property of the cross product — intuitively obvious from its geometric interpretation — is its 
behavior under the action of a proper rotation of matrix 7 (with detR = 1), ie., 


(Ra) A (Rb) = R(ar b). (A.21) 


2 Other families of (real) matrices R of fundamental importance in dynamics are the symmetric matrices R! =R; the 
antisymmetric matrices R! = —R:; the 2n x 2n symplectic matrices RIIR = J, where 


ae a) I Tes. 
re(%, ale Jé=-J, (A.20) 


and / is the n x n identity matrix; the 2n x 2n Hamiltonian matrices (JR)T = JR; and the normal matrices 
[R,R™]=RR'-RIR =0 (see also Chapter 9). 


A.1 Identities of Vector Calculus 299 


With the introduction of the Levi—Civita tensor €;;,, we can write a A b = €;;xe;a jx, 
and recalling that €;jx€kim = 9:15 jm — dim4j1, the student can prove that the triple vecor 
product can be written as 


a (bA c) = bia,c) — c(a,b). (A.22) 


Moreover, the mixed product (a,b Ac) is geometrically interpreted as the (oriented) volume 
of the parallelepiped with sides given by the three vectors, and from Eq. (A.19) it is given by 


aj a a 
(a,b A c) = det(a,b,c) = |b; bo b3]. (A.23) 
Cy C2 3 


From a well-known property of determinants, the vectors can be cyclically rotated in the 
mixed product, so that 


(a,b Ac) = (b,c Aa) = (c,aADb). (A.24) 


A.l.1 Change of Reference Systems 


The mathematical treatment of the change of reference system is of obvious importance in 
physics. Here, we follow the elegant geometric approach adopted by Arnold (1978). 
We consider an inertial system So and a second system 5S’, so that in general 


x= Ro+ Rx, x’ = R'(x— Ro), (A.25) 


where Ro(f) is the instantaneous position of the origin of S’ in So, x is the vector position 
in So, and x’ is the vector position in S’. In general, R(t) is a time-dependent orthogonal 
matrix describing the instantaneous orientation of S’ with respect to Sp, and so RR! = 
R'R = I. The time derivative of this identity shows that R'™R is an antisymmetric 
operator, and from Eq. (A.19) 


0 —W3 W2 
R'R={1 0; O  -a |], (R'R)x =x. (A.26) 
—W2 | 0 


@ is the instantaneous angular velocity of S’ with respect to So, as seen in S’, and it is a 
simple exercise (do it!) to show that the angular velocity of S’ observed in So is given by 


Q = Ro. (A.27) 


We are now in a position to derive the transformation formulae for the velocity in the two- 
coordinate systems: from Eqs. (A.25) and (A.26) 


KX=Vot+R@AxX+4+xX), xX =RI(K—Vo)—@rx’, (A.28) 
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where Vo = Ro is the instantaneous velocity of the origin of S’ as seen from So. The 
transformation of accelerations is obtained from the time derivative of Eq. (A.28). 
% = Ag+ R[X +@Ax'+20AX +@A@Ax)], 


. Te ; ; (A.29) 
x’ = R’(K — Ag) —W@AX’ — 2@0AX’ — WA (@AX’), 


where Ap = Vo is the instantaneous acceleration of S’ with respect to Sg. In the second 
identity in Eq. (A.29), the last three terms on the right-hand side are the Euler’s force, the 
Coriolis’ force, and the centrifugal force, respectively. Of course, in the inertial system So, 
Newton’s Second Law of Dynamics dictates that for a point mass X = g, where g is the the 
sum of all of the forces (per unit mass) acting at x. 

Three comments are in order here. The first is that g’ = R"g is the total (physical) force 
per unit mass expressed in S’ so that, in the particular case of physical forces derived from 
a potential #(x) in So, it follows that 


8 = —R'Vxb(X) xR Rx = —Vx'b'(®), (x) = o(Ro + Rx’), (A.30) 


as can be proved by direct evaluation; notice that ¢’ can depend explicitly on time through 
Ro even if ¢ is time independent. The second comment relates to angular accelerations: it 
is an useful exercise (do it!) to prove from Eq. (A.27) the not-so-obvious transformation 
identity 


@ = Re, (A.31) 


and to show that higher-order time derivatives of the angular velocity in general do not 
obey similar identities. The third and final comment relates to the centrifugal force term 
that from Eqs. (A.14), (A.22), and (A.24) obeys the following easily proved identities: 


Ilo A x’ 
qos 


@ A (@ Ax’) = (@,x')o — lols ‘= —Vx 2 ? (A.32) 


]o A x! |? = |lool|* Ix? — (.x')?, 
so that the centrifugal force can be derived from a centrifugal potential. 

With the aid of Eqs. (A.25), (A.28), and (A.29), we are now in a position to derive 
the transformation formulae for all of the important mechanical quantities of particles and 
systems of particles, such as kinetic energy, momentum, angular momentum, torque, and 
so on. In the following, we will only discuss a couple of applications of great relevance, 
and the student is invited to work out as an instructive exercise the remaining cases and 
then specialize the results for S’ = Scy (i.e., to assume? Ro(t) = Rcm(t)). 


3 See Chapter 6 for the even more special case of pure translation (i.e., Rg = Rcy and R = /). 
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A.1.1.1 Angular Momentum, Torque, and Kinetic Energy Transformation 


From the definition of angular momentum in a generic inertial reference system So 
with reduction center xo for a system of masses m; and total mass M = ; m;, from 
Eqs. (A.21) and (A.22) we obtain the well-known identity 


J = Sox — x0) Amiv; = (Rem — Xo) AMVom + RU + 3'e), (A.33) 


where J’ = }°; x; A mjv;, is the system’s angular momentum in the barycentric reference 
system Scyy and 3’ is the barycentric (discrete) version of the inertia tensor in Eq. (2.89). 
Notice that the last term in Eq. (A.33) can be rewritten as 


Ro = RYR'D = 8Q, (A.34) 


where S is the (in general time-dependent) barycentric inertia tensor of the system as seen 
from So and @ is the angular velocity of Scm as seen from So. A particularly important 
case is obtained when the system of particles is a rigid body and Scm is fixed in it, so that 
J’ = 0 and @ is the rigid body angular velocity; in this case 


= (Rcm — x0) \MVcom + SQ. (A.35) 
Similarly for the torque 


N= U(x) — x0) AFi = (Rem — Xo) A MAcm + RN, (A.36) 
i 
where F; is the total force acting on the particle i, N’ = )7; x, AF’, and F, = R'F;. 
Another important transformation formula is that of the system’s kinetic energy, and 
following the same treatment adopted for the angular momentum we now get 


V ¥/@,@ 
250) “eyo silk +74 V,o)+ Se) (A.37) 


where T’ = )o; IIvj 17/2 is the system’s kinetic energy in Scy and J’ and %’ are the 
quantities introduced above; notice that the identity (9’@,@) = (8@,@) holds. Again, if 
the system of particles is a rigid body and Scy is fixed within it, then T’ = 0 and J’ = 0, 
and one recovers the well-known identity 

Vom? (82,2) 


T=M ; A.38 
oto (A.38) 


A.1.1.2 The Jacobi Integral 


We conclude this discussion with an important result concerning a conserved quantity 
along the orbit of each particle (the so-called Jacobi integral) in potentials rotating with 
uniform angular velocity around some fixed point; the relevance of this conservation law 
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in the modeling of rotating stellar systems should be obvious, even though we will not 
pursue the subject further here (for applications, see, among others, Bertin 2014; Binney 
and Tremaine 2008; Chandrasekhar 1942; Ogorodnikov 1965; Szebehely 1967). With the 
previous assumptions, consider the case of uniform rotation (i.e., @ = 0) of the system 
S’ with a fixed origin Ro = 0, and the motion of a particle under the action of a force 
field derived from a potential @(x) in So, so that the transformed potential $’(x’) does 
not depend explicitly on time.+ Under these assumptions, consider the second identity of 
Eq. (A.29), where the Euler’s force vanishes because @ = 0, the centrifugal force is 
expressed as in Eq. (A.32), and the physical forces are given by Eq. (A.30) 


lo A x’? 
2 
where #; (x’) is known in celestial mechanics as the Roche potential. Following the stan- 
dard treatment, we now multiply both sides of the obtained identity by x’, and we use the 
fact that Coriolis’ forces do not work. It follows that in S’ the Jacobi integral of a test 

particle 


X = —Vy Gp(x') — 20 AX, bh =9'(x) - ; (A.39) 


Cy = T' +mop = E — (JQ) (A.40) 


is conserved. In the first expression in Eq. (A.40), T’ is the kinetic energy of the test mass 
in S’. Remarkably, as is shown by the last expression in Eq. (A.40), the conserved quantity 
Cy can also be written as a combination of the particle energy E = T + m@ and angular 
momentum J in So (where in general these two quantities are not conserved!), as can be 
proved with a little algebraic work by using Eqs. (A.25) and (A.28). Of course, in S’ we can 
introduce again the concept of zero-velocity surfaces (see Chapter 5), which in the present 
case are known as Hill’s surfaces, and conclude that in S’ the motion of the test particle is 
limited to the region(s) 


bp(X’) < ov (A.41) 
m 


for assigned values of the Jacobi constant, while the equilibrium points coincide with the 
critical points of the Roche potential. Therefore, for a given rotating potential (e.g., that 
obtained with the methods described in Chapter 2), it is possible to study the motion of 
tracers (e.g., gas flows) in rotating stellar systems such as the bars at the center of disk 
galaxies. 

We cannot resist mentioning that in the special case of two particles in mutual circular 
orbit, the positions of the resulting five critical points of @p (x’) give the five Lagrange libra- 
tion points of equilibrium (L1,2,3 corresponding to the Euler collinear stationary solution 
and L4,5 corresponding to the Lagrange equilateral stationary solution; see also Chapter 6 
and Figure A.1). We also recall that L1,2,3 are unstable saddle points for dR (x’), while L4.5 


4 Consider, for example, the potential produced by two point masses in circular orbit around their center of mass in the restricted 
three-body problem, or the potential produced by a triaxial ellipsoid rotating around its center of mass with constant angular 
velocity parallel to one of its principal axes. 
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Figure A.1 Hill’s surfaces for the restricted three-body problem for masses m and m on the x’-axis. 


All lengths are normalized to d = x, - a masses to M = m, + mp, and potentials to GM/d; the 
dimensionless mass ratios are m2/M = i and m,/M = 1 — A, so that from Eq. (4.7) x;/d =-i 
and x/,/d = 1 — A, and finally o” = GM/d? from Eq. (4.31). In this figure, we adopted A = 1/10, 
and the normalized values of the Jacobi constant along the zero-velocity Hill’s curves are a = —3, 
b = —1.8,c = —1.7335, d = —1.55, and e = —1.456; the five Lagrange libration proints are 
indicated. Recall that for fixed Cy the motion of a test particle is possible only where dR < Cy/m 


(e.g., see the remarkably clear figure 8.2 in Danby 1962). 


are maximum points for the Roche potential, even though they can be stabilized (for suit- 
able values of the mass ratios) by the effect of Coriolis’ forces. For detailed discussions of 
these fascinating aspects of celestial mechanics, the student is invited to consult Binney 
and Tremaine (2008), and in particular specialized treatises such as Abraham and Mars- 
den (1978), Binney and Tremaine (2008), Boccaletti and Pucacco (1996), Danby (1962), 
Hagihara (1970), Meyer and Hall (1992), Roy (2005), Szebehely (1967), and Valtonen and 
Karttunen (2006). 

We conclude this section by noticing that Eq. (A.40) holds for a single particle in a 
time-independent external potential in the uniformly rotating reference system S’. A natural 
question arises as to whether something similar holds for an N-body system. Of course, a 
basic case is represented by a system simply described as the sum of N particles. In this 
case, the global Jacobi constant is just the sum of the Jacobi constant of each particle, 
where E and J are in general not conserved. A more interesting case is represented by a 
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self-gravitating system in a general state of motion. It is a nice exercise (do it!) to show that 
an identical relation holds when observing the system from a rotating reference system S’ 
with constant angular velocity & = Re; of course, in this case, E and J are conserved. 


A.2 Special Functions 


Here, we summarize the most important properties of the special functions encountered 
in this book. As the literature on the subject is immense, it is impossible even to think 
about presenting a coherent discussion of the subject in a few pages. Among the classical 
references, we may recall Abramowitz and Stegun (1972), Arfken and Weber (2005), 
Erdélyi et al. (1953), Gradshteyn et al. (2007), and Prudnikov et al. (1990). 


A.2.1 Gamma-Related Functions 


Perhaps the most famous and used special function encountered in physics is the Euler 
gamma function, which for real arguments reads 


[o.@) 
ray = f tledt, x>0. (A.42) 
0 


Integration by parts shows that this function generalizes the arithmetic factorial function, 
with the recursive relation 


T(ix+1)=xT(), (A.43) 
and for integers n > 0 


2"+1P(n + 3/2) 


ni=T(n+1), Qny!=2°-Tn4+1), Qn4+D!= EE 


(A.44) 
Special values are 
rd)=f@Q=1, (5) =/n. (A.45) 


Three important relations obeyed by the Gamma function are the duplication formula and 
the reflection? formula 


ray 2 /s+1 ri») (A.46) 
— > —xX =7 a z 
T'(x/2) JI 2 . sin wx 
and finally the Stirling expansion 
TQj<sorx Pe *, x= 00, (A.47) 


5 The Gamma function is not defined for negative integers; notice, however, the important alternative expression of the 
reflection formula that is obtained from Eq. (A.46) for x = 1/2 — y, and the resulting identity in the special case of integer y. 
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The binomial coefficients appearing in the well-known identity 


n 
n n n! 
by" = a =. A.48 
(a+b) > (ie (‘) ki —k)! ee) 
k=0 
can be generalized to real variables by using the Gamma function as 


(*) = rer!) (A.49) 
y) ~ F@-y+bror) 


and in particular Newton’s binomial theorem can be formally written as 


k 2! 


Fe Sa k a(a—1) , 
(+2%= > () jek = l+ozt+— 2 +... lel <b (A.50) 
k=0 


Two functions are associated with the Gamma function and are quite often encountered in 
problems of stellar dynamics: the incomplete left and right Gamma functions 


x [o.@) 
y(a,x) = / t? edt, Tax) = i ae ae (A.51) 
0 x 
Of course, a > 0 for the existence of y(a,x), and in this case y(a,x) + T(a,x) = I'(a). 
They obey the important recursion relations of easy proof 


y(at+1,x) =ay(a,x)—x“e*, T(at1,x) =al(a,x)+x%e™. (A.52) 


The Euler Beta function is defined for real arguments x > 0 and y > 0 as 


1 n/2 
Boy = [ pla alr = | (sin z)**—! (cos z)”—! dz. (A.53) 
0 0 


From the first integral, it is a trivial exercise to show that B(x, y) = B(y,x), while the 
equivalence with the trigonometric integral can be obtained with the change of variable 
t = sin’ z. A not-so-trivial result is the link with the Gamma function 
_ F@To) 

Txt y)’ 


and, as for the Gamma function, an incomplete case is also considered for the Beta function 


B(x, y) (A.54) 


Pi 
B(p.q:x) = / PH — atlas, (A.55) 
0 


where obviously B(p,g;1) = B(p,q), and another common notation is B(p,g;x) = 
B, (p,q). Note that for generic values of x, B(p,q;x) is not symmetric in the exchange 
between p and q; the expansion 

xP d-aq)xitP 


B(p,q;x) ~ —+ , x0, (A.56) 
Pp Lae pe 


is of frequent use. 
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The error function 


2 f* _» y (x?, 1/2) 
Erf(x) = — | e' dt = —___,, (A.57) 
/T JO ft 
is a special case of the right incomplete Gamma function, where Erf(oo) = 1 from 


Eq. (A.45). Strictly related to the error function are the two Dawson functions 


x 


2 [* p 2 _p 
Fi(x) =e e dt, F_(x)=e e dt. (A.58) 
0 0 


Finally, we mention a few other special functions sometimes encountered in stellar 
dynamics: the exponential integral 


[o,@) 
Ex(z) = / t*e-@dt = 2* IT (1 —k,2), (A.59) 
1 


E oF Ex) _ e@*  KEk+1@) — dEx(z) 
KZ) = = ~ 
f=1 Fi Zz Zz dz 


(with the related function Ei) and the Eulerian logarithmic integral, sine integral, cosine 


Ex-1() (A.60) 


integral, and the polylogarithm 
: © cost : ok 
dt, CiZ)=- ——dt,  Lis(z) = ae 
< : k=1 


. < dt . = sint 
Lie) = f ee 
2 Int 0 t 


(with the related functions li, si, and ci). As usual, the polylogarithm is given as a series 
over the unitary disk (see in particular Lewin 1986; Prudnikov et al. 1990). 


A.2.2 Elliptic Integrals and Hypergeometric Functions 


Elliptic integrals of the first, second, and third kinds in the Legendre—Jacobi form are 
given by 


: dv “e dt 
Fig.) = |’ —2__ = | - (462) 
0 JI1-kRsirto Jo JS —2)0 — kt?) 
g sing [|] _ 242 
E(g,k) = i 1—k2 sin? dd = i = de (A.63) 
0 0 1—?? 
dv sing 
rig.n.k) = f = ; 
0 (l—nsin2?d)J1—ksin2?9 Jo (art?) fd — 2) — k22?) 
(A.64) 


where k is the modulus and —1/2 < g < m/2. The complete elliptic integrals are also 
frequently encountered in applications, and they are defined as 


K(k) = F(x/2,k), E(k) = E(t/2,k),  T1(n,k) = M1 (1/2,n, k), (A.65) 
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so that K(O) = E(O) = 2/2, while K(k) ~ — In./1 — k and E(k) ~ 1 fork > 17, and 
dK(k) E(k) K(k) dE(k) E(k) K(k) 
dk — k(1—k?) k°? dk ek k 
Note that the (standard) definition above also leads to real values of the integrals for purely 
imaginary values of k (i.e., when k? < 0). We stress that different definitions may be found 
in the literature (and in computer algebra systems) for the parameters k and n, so special 
care must be exercised in applications. Elliptic integrals satisfy an enormous number of 
quite amazing identities, and the student should be prepared to spend some time with tables 
(e.g., see Byrd and Friedman 1971; Gradshteyn et al. 2007; Prudnikov et al. 1990). 
Elliptic functions were discovered and studied in depth (among others) by Abel (1802-— 
1829), Jacobi (1804-1851), and Weierstrass (1815-1897). They play a fundamental role in 
several area of physics and in particular in mechanics; perhaps the simplest application is 
the motion of a pendulum beyond the harmonic approximation (e.g., see Lichtenberg and 
Lieberman 1992; Meyer and Hall 1992). Here, we simply recall that for a given value of 
the variable u, the angle g obtained by inverting u = F(qg,k) leads to the definition of 
(Jacobian) amplitude, sine-amplitude, cosine-amplitude and 5-amplitude, respectively 


g=amu, sing=snu, cosp=cnu, /1—ksin?g=dnu. (A.67) 


Note that for k = 0 the amplitudes reduce to identity and the elliptic functions reduce to 
trigonometric functions. 

Among the special functions, the Gauss (1777-1855) hypergeometric functions occupy 
a preeminent role. They were introduced as the power series with convergence radius 1 


ab a(a+1)b(b+1) 22 
Fi (a,b;c;z) =1 naan 
2Fy (a,b; c; z) = aor gio 7 


2F, (a,b;c;0) = 0, and of course 9Fj(a,b;c;z) =2 F,(b,a;c;z). One of the most used 
integral representations is 


F(a, b3c;z) = ep fe —1)°-!G — z2t)“"dt, c>b>0. (A.69) 
—— P(b)l(e — b) Jo , 


These functions are solutions of the extremely important second-order differential equation 


(A.66) 


c#0,—1, —2,..., (A.68) 


d? d 
2(l— 2) <4 +[e— (a+b + IzI= — aby =0, (A.70) 
dz dz 


whose regular (fixed) singular points are at z = 0, 1,00 (e.g., see Bender and Orszag 1978; 
Ince 1927; Lomen and Mark 1988). As is well known, a large part of elementary and higher 
transcendental functions can be expressed by using the 2F; functions for special choices of 
the parameters, and an impressive number of identities, transformations, recursion rela- 
tions, and analytical prolongements are knwon (e.g., see chapter 9 in Gradshteyn et al. 
2007). Finally, the 2F; functions are generalized to the , /, functions, and further to the 
Meijer G-functions, and to the even more general family of Fox H-functions (e.g., Mathai 
et al. 2009; Prudnikov et al. 1990). 
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A.2.3 Legendre Functions 


As we have seen in Chapter 2, the Legendre functions arise naturally when separating the 
three-dimensional Laplace operator in spherical coordinates (e.g., see Binney and Tremaine 
2008; Jackson 1998). In particular, the second-order, linear differential equation relative to 
the angle }, known as the Legendre (1752-1833) differential equation, is given in its more 
general form by 


amy ry ae ee ee Wl vam w2ee (A.71) 
sin - =0, <0<T, ; 
sind a0 dd sin? 3 


where A and y are parameters that can assume complex values. Equation (A.71) is also 
encountered in algebraic form, obtained from the substitution x = cos ? 


£ a=] +[sa+4v- we ]v=° —-l<x<l. (A.72) 
dx dx 1— x? 

The literature on the properties of the Legendre equation is immense (e.g., Arfken and 
Weber 2005; Binney and Tremaine 2008; Erdélyi et al. 1953; Gradshteyn et al. 2007; Ince 
1927; Jackson 1998; Lomen and Mark 1988; Prudnikov et al. 1990), so here we only recall 
what is strictly necessary. Being a linear second-order equation, there are two independent 
solutions, the Pp, and OF Legendre associate functions, and these solutions are special 
cases of the Gauss hypergeometric function; in some treatise, Py = P, and ony = Q, are 
simply named Legendre functions. The Fuchs (1833-1902) theorem proves that there are 
two regular singular points atx = +1 (i.e., geometrically) along the z-axis. Moreover, 
the Frobenius (1849-1917) method shows that Qi (+1) are singular independently of the 
values of A and ju, but Be are instead regular at x = +1 for the special case A = / = 


0,1,2,..., non-negative integer, and 4 = m integer, with —/ < m < 1. From now on, 
we restrict our discussion to P/” functions (sometimes named Legendre polynomials, even 
though they are true polynomials only for even values of m, with |m| < /). Remarkably, 
in the separation problem discussed in Chapter 2, the univocity of the azimuthal function 
over 0 < @ < 27 selects 4 = m, and so with the orientation of the z-axis being arbitrary, 
only the P/” functions with |m| < / can be retained in order to describe physical solutions 
over all of the space. Several recursive formulae are known, and here we report the useful 
Rodrigues identity 


md™Pi(x) — (-1)™ ee 
Be 


m _ m 2 1 
P(x) = (-D™( ~ 3°) = m1, (4.73) 
where the first identity holds form = 0,1,2,...,/, and the second also allows us to 
determine the P;” functions form = —/, —1+ 1, —1+2,..., — 1. Note that P/"(x) = 0 


form = 1+ 1,1 + 2,..., and that Eq. (A.73), evaluated for m = 0, gives the Legendre 
polynomials P)(x) = Py (x); moreover, it can be proved that 


(i—m)! 


a era. —l<m<l. (A.74) 


P= = (- 1)" 
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We note here (as it is quite often source of confusion) that ae (x) 4 0 form = —/ — 1, 
—]—2,..., as can be proved from eq. (8.752.2) of Gradshteyn et al. (2007), and iterated 
integral representation extends Eq. (A.73) to values of m < —1. 

It is not suprising that the Legendre functions obey an impressive number of recurrence 
relations, and here we simply recall that 


(21 + 1)xPM (x) = (=m + 1) PI (x) $m + DP" (x), (A.75) 
pr (—x) = (en "Pt (x). (A.76) 
In particular 


0, J+m = odd, 


( ar +m = even, 


and so from Eqs. (A.44)-(A.46) 

(-1/Td4+1/2) _ (-D/@D! 
JVmrd+1) 24)? 

Finally, pr (+1) = 0 form ¥ 0, and 


Py41(0) = 0, Py(0) = Po(0) = 1. (A.78) 


P)(+1) = (£1). (A.79) 


It is a fundamental result that Eq. (A.72) obeys the Sturm—Liouville theorem? at fixed values 
of m, so that P)” at fixed m are a family of orthogonal functions of weight 1, and from 
Eq. (A.73) it can be proved that 


1 

/ Pi" (x) Pr (x)dx = ee aisle Ll! > |ml, (A.82) 
-] 21+1(—m)! 

so that P)” can be easily normalized to an orthonormal set. Some additional work proves 

that, at fixed m, the Pr" (x) are a complete set of functions with respect to / > |m|, and so 

these functions can be used to expand a function f(x) over the interval —1 < x < l,ora 

function f (%) over the interval 0 < 3 < 7, ina Legendre series. This property will be used 


6 We recall that a linear second-order ordinary differential equation is in the form of a Sturm—Liouville problem when it is 
given as 


d 


dx 


dy 
[rw | + q(x)y = Aw(x) y, (A.80) 
dx 


where w is the so-called weight and i is a parameter. Here, we restrict ourselves to the real Sturm—Liouville problem. We 
consider two values i; and Az and two solutions y; and y9 over the interval (a,b). By multiplication of the equation for y; 
with the solution y2 and integration by parts, interchanging the index in the resulting epression, and subtracting the two 
identities, one arrives at 


b 
(4, — Aa) [ yi@)y2@)w@dx = parr} — yiy5)12, (A81) 
a 


so that if p vanishes at the integration interval and/or the Wronskian vanishes, the two solutions corresponding to different 
values of A are orthogonal with weight w. 
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in Chapter 2 to construct, jointly with a Fourier series, the family of spherical harmonics, 
a fundamental tool in potential theory. Finally, again following the discussion in Chapter 2, 
the generating function of Legendre polynomials can be obtained by equating the Green 
function for a mass point on the z-axis with the multipole expansion for the same problem, 
proving that 


Ss -> Pi(x) ¢!. (A.83) 


An analogous formula holds when expanding the function (1 — 2xt +t arising in the 
study of power-law forces, and when the Legendre polynomials are substituted by the more 


2) —a1/2 


general Gegenbauer polynomials CY (x) (sometimes known as ultraspherical polynomials; 
e.g., see Hernquist and Ostriker 1992 for an application to potential theory). 


A.2.4 Bessel Functions and Hankel Transforms 


The Bessel functions arises naturally when separating the three-dimensional Laplace oper- 
ator in cylindrical coordinates (e.g., see Binney and Tremaine 2008; Jackson 1998), as 
discussed in Chapter 2. The second-order, linear differential equation relative to radius R, 
known as the Bessel (1784—1846) differential equation, is given in its more general form by 


ld dU Va 
———= | R—— kr —-—)uU=0, A.84 
nal a) +{ “) cae) 


where k and jz are parameters that can assume complex values. A remarkable property 
holds for Eq. (A.84): with the introduction of a scaled radius x = kR, Eq. (A.84) becomes 


12 (sZ)+(1-S)u=o O<x <0, (A.85) 
x dx dx x? 

In particular, once Eq. (A.85) is solved in terms of U,,(x), it is possibile to obtain 
immediately the solution of Eq. (A.84) as U,(kR), so that the parameter k is actually 
contained in the argument of the function. As for the Legendre functions, the literature 
on Bessel functions is immense (e.g., Arfken and Weber 2005; Binney and Tremaine 
2008; Erdélyi et al. 1953; Gradshteyn et al. 2007; Ince 1927; Jackson 1998; Lomen and 
Mark 1988; Prudnikov et al. 1990; Watson 1966), and great attention should be given 
when consulting specific treatises and tables, because Bessel functions are often known by 
different names/symbols in different contexts. Here, we adopt the most used nomenclature 
for the two independent solutions of Eq. (A.85), and we indicate with J,, and Y,, the Bessel 
functions of the first and second types; needless to say, Bessel functions are special cases 
of the Gauss hypergeometric functions, and so they obey an impressive number of relevant 
identities. Concerning our problem originated by separation of variables for the Laplacian 
in cylindrical coordinates, the Fuchs and Frobenius theorems show that only J,, functions 
are well behaved near the origin, and this is only for 1 = m integer. Restricting ourselves to 
this family of functions, here we simply recall a few fundamental identities of frequent use 
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2m ef 
Jin—10%) + Imi (x) = —Im (x), Jem) = (-1)"JSin@&), (A.86) 


and J,,(0) = 0 for integer m 4 0, while Jo(0) = 1. Moreover 


Din (X) -_ Jm—1(%) — Im4i1(%) dJo(x) 
dx 2 eae = 


where the last identity is easily proved from the general one. Finally, by using the Sturm— 
Liouvelle theorem, a comparison of Eqs. (A.80) and (A.84) shows that the functions 
Jm(KR) at fixed m are orthogonal with weight R for different k, and in fact the following 
Hankel closure relation can be proved: 


Ji), (A.87) 


oa) (A.88) 
a 


[ RJm(aR)Jm(bR)dR = 
0 


The case where a ¥ b is simply a consequence of orthogonality, but the case where a = b is 
not trivial, and it is not proved here. The closure relation allows us to introduce an important 
family of integral transforms, the so-called Hankel transforms, based on Bessel functions. 
In fact, under the assumptions that the closure relation holds and that all of the necessary 
convergence requirements are satisfied, the student is invited to “prove” by inverting the 
order of integration that the perfectly symmetric pairs of identities hold at fixed integer m 


fa) = | kIm (kx) fn (kak, fink) = | XS (kx) f (x)dx. (A.89) 
0 0 


We conclude this section by recalling the family of modified Bessel functions of the first 
(Im) and second (K,,) kinds, which can be obtained from the Bessel functions evaluated 
for purely imaginary arguments (e.g., see Arfken and Weber 2005) or as solutions of the 
modified Bessel differential equation 


1d (dU 2 
x ie SG tw Sc, (A.90) 
x dx dx x? 


Some of the most relevant and used identities are the following: 


2m 
Tn—1 (x) — Tn41(X) = — Im (x), Iem(x) = In(@®), (A.91) 
and I, (0) = 0 for integer m 4 0, while Ip(0) = 1. Moreover 
Am (x) -_ Tin—1(%) + In 41 (x) dIo(x) = 


at : oS = hi). (A.92) 
Finally 
2m 
Kale Ka Sa eas Boat ha) re 
dKn(X) Km) + Kms) dKol®) gy, (A.94) 


dx 2; , dx 
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A.2.5 Dirac 5-Distribution 


As is well known, the so-called Dirac 6-function is not a function in a technical sense, 
but a distribution. The approach in this section is necessarily “phenomenological,” but 
the student is reassured that the theory of distributions is rigorously founded (e.g., see 
Vladimirov 1979), and all of the results presented here can be formally proved. In 3”, by 
using Cartesian coordinates, and for any sufficiently well-behaved function f(x), the 6 can 
be introduced operationally as a mathematical object such that, for an open set Q and a 
point xo 


0, xo ¢ Q, 
. f(%)8(K — X0)d"x = (A.95) 

Q Ff &o), Xo € Q. 
Note that 6(x — xo) = 6(xo — x). The meaning of the Dirac 5-function when xo belongs to 
the frontier of 2 would involve some discussion that we avoid here; it is sufficient to recall 
that in some case (but not always) the integral in Eq. (A.95) can be shown to evaluate to 
F (Xo) /2 when xg € 0&2. Three immediate consequences derive from Eq. (A.95): the first is 
obtained by considering f = 1, with the formal result being that the value of the integral of 
the 5-function equals’ 1 if x9 € Q (independently of the specific position of xq), while 
it is 0 for external points. The second fact is that when used in physical applications, the 
6-function is not a dimensionless object, but has units of the inverse of the n-dimensional 
volume. Third, if we pretend that the Dirac 5-function obeys the rules for multiple integrals, 

it is reasonable to expect (and in fact it can be proved) that in Cartesian coordinates 


n 
5(x — xo) = | [ai — x0) (A.96) 
i=1 

(i.e., the multidimensional 6-function can be factorized as the product of a one-dimensional 
6-function, one for each coordinate). When working with the 5-function in curvilinear 
coordinates, which is a common situation in physics (see Appendix A.8), some care is 
needed. In fact, it is quite obvious that the defining property of the 6-function in Eq. (A.95) 
must be independent of the specific coordinates adopted to evaluate the integral, and so the 
transformation rule of the 6-function from Cartesian to curvilinear coordinates is 


[jai 8a — Goi) 


ial 


n 
, [Fol =| [a. (A.97) 
i=l 
where J(q) is the Jacobian of the coordinate transformation and qo are the curvilinear 
coordinates of the regular point xo (i.e., where J 4 0). The last expression in Eq. (A.97) 
holds for orthogonal curvilinear coordinates, and the Lamé coefficients h; are given in 
Eq. (A.140). Notice that in Eq. (A.97), the J can indifferently depend on q or qo, because 


7 Notice that this property clearly shows that the 5-function is not — as sometimes stated — a function equals to 0 everywhere 
except for a point x9 where its value is 00; the volume integral of such an object would be 0 even for xg € Q. 
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when performing integrations J is in any case evaluated at qo. In particular, in the one- 
dimensional case 


d(x — xo) 
lg’(xo)| 


é[g(x) — y] = 8 (x0) = y, (A.98) 


when xo is not a critical point for g. 
A function strictly related to the 5-function is the Heaviside step function 


oxy |b *2% (A.99) 
x= A 
0, x <0, 


where the value for argument 0 is somewhat a matter of definition. For the student, it is a 
nice exercise to show that, if allowed to perform standard integration by parts, then 


[ d0(x —a) 
— Fide (A.100) 
A dx 


would produce for values of a inside/outside the integration interval, the same result 

obtained by using the 6-function; in other words, we could formally use 
d(x — a) 

= dx 

This result is at the basis of one of the most useful methods for building the Green function 

for ordinary differential equations (ODEs; e.g., Bender and Orszag 1978; Jackson 1998; 

Lomen and Mark 1988; see also Chapter 2 and Appendix A.7). 


d(x — a) 


(A.101) 


A.3 Fourier Transforms and Series 


The Fourier (1768-1830) direct and inverse transforms of a well-behaved function f (x) 
defined over St” can be written as 


—i(k,x) i(K,x) 


e 
(27)? 


é 


( 
(27)? 


fay= | roo—Za'x, f= / fds) a", (A.102) 
an gan 
where k € {t” is the so-called wave vector. Fourier trasforms are some of the most powerful 
tools of mathematical physics, and the literature dedicated to them is immense; a very clear 
introduction to the subject can be found in Arfken and Weber (2005), where other important 
integral transforms (e.g., the Hankel, Laplace and Mellin transforms, often encountered 
in stellar dynamics) are also presented. Here, we simply recall that the kernel function 
appearing in Eq. (A.102) is the spatial part of a plane wave e!“&*)—@) propagating perpen- 
dicularly to k, with wavelength 7 = 2zr/||k||, period T = 27/a, and phase velocity vg = 
A/T = w/||k\|. The fact that these functions are eigenfunctions of the Laplace operator 
in Cartesian coordinates (see Chapter 2 regarding the construction of the Green function 
for the Laplace operator) is of fundamental importance not only for classical physics, but 
also in quantum mechanics, where (for example) the kinetic energy of a particle is repre- 
sented in the nonrelativistic Schroedinger equation as the Laplacian of the wave function 


314 Mathematical Background 


(for an illuminating discussion, see Messiah 1967). Mathematically, it can be proved that 
the Dirac 5-function in Eq. (A.96) can be Fourier transformed, and from a formal integra- 
tion we have the beautiful integral representation 


e7 1 \k, x0) 


800 = Goa 


6(x — x0) = si i; : gi he m> ang, (A.103) 
The student is invited to meditate on the amazing fact that the Dirac 5-function, admittedly 
the mathematical object nearest to the concept of a “point mass,” can also be seen as the 
sum of infinitely extended waves! 

Fourier series are mathematical objects that are strictly related to Fourier transforms. 
In the one-dimensional case, for any well-behaved function f(g) defined over the interval 
0 < » < 2r, it is possible to show that, in analogy with Eq. (A.102) 


eo eli . Qn ea img 
fo= Do ine n= fo [OF 


In practice, this fundamental result tells us that a “generic” function, defined over a finite 
interval, can be expressed by adding a sufficient number of periodic functions weighted 
with suitable coefficients; we recall that this family of functions arises naturally when 
separating variables (e.g., spherical and cylindrical) in order to determine solutions for 


dg. (A.104) 


the Laplace equation, and requiring uniqueness after a whole rotation of 27 around a given 
direction. In particular, it is possible to expand the Dirac 6-function as follows: 


[ee 


1 : 
= §1O=t)=  eee (A.105) 
2m 2x m>=—OO 


The mathematical and physical literature on Fourier series and their properties is enormous, 
and again the student is invited to consult, for example, Arfken and Weber (2005). 
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A.4 The Gauss and Stokes Theorems 


Several excellent presentations of differential calculus in curvilinear coordinates exist. At 
the level of undergraduate and graduate students, remarkably clear books are Do Carmo 
(1976) and Edwards (1994), and for the more physically oriented readers, good references 
are Arfken and Weber (2005), Aris (1989), Becker (1982), Borisenko and Tarapov (1979), 
and Narashiman (1993). In any case, a reading of the beautiful presentation of the subject 
in volume 2 of Feynman et al. (1977) is highly recommended. 

Two of the most far-reaching results of vector analysis are the Gauss (divergence) and 
Stokes (circuitation) theorems. These basic results are reported here in Cartesian coordi- 
nates and in Appendix A.8 in curvilinear (orthogonal) coordinates. We first introduce the 
concepts of gradient, circuitation, and flux. 

Let f (x) be a well-behaved function from ‘t” to t, so that (loosely speaking) the identity 
Ff (x) = c determines, for a given value of c, ann — 1 dimensional “surface” (variety) in 3t”. 
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Moreover, let x(A) be a regular curve on the surface (i.e., f[x(A)] = c, with x(0) = xo). 
Then, in Cartesian coordinates we can introduce the mathematical object grad f from the 
obvious identity 


df dx 
0O= — = (grad f, — (A.106) 
dd |,=0 dx] \,— 
where adopting the convention of summing over repeated indices 
0 
grad f =Vf, V=e— (A.107) 


OX; 
Being the vector dx/da for A = 0 tangent to the surface at x = xo, it follows that grad f is 
perpendicular to the surface at xo. 

A vector field that can be derived from the gradient of a scalar function is called exact, 
and the scalar function is called a potential. The importance of exact fields in physics is 
immense because they describe, among other things, conservative force fields (in the time- 
independent case®), and irrotational fluids in simply connected regions. 

Now let g(x) be a well-behaved vector field from {%” to 3”, and let y (A) be some regular 
closed curve in )t” with a < 4 < bso that x(a) = x(b). The circuitation of g along y is 
naturally defined as 


b dx 
cer) = | (eas) =) (2). (A.109) 
y a 


It follows that C(g, y) = 0 for arbitrary closed curves if g is exact, and in turn this means 
that the value of a line integral of g between two arbitrary points is actually independent of 
the path adopted to connect the two points. As is well known, physically this means that 
the work (the line integral) done by a conservative (exact) force field g depends only on the 
initial and final positions. Moreover, it can be proved that a field defined over a connected 
region of }t” (i.e., qualitatively speaking, a region of space so that two arbitrary points can 
be connected with a continuous path) is exact if and only if the circuitation over all closed 
curves is zero. Note that this important result does not require for the region to be simply 
connected (i.e., that all of the closed curves can be continuously shrinked to a point). 

Finally, the flux of the well-behaved vector field g(x) over some regular n — 1 dimen- 
sional surface of $i” is defined as 


Fg,x)= / (g,n)d"~'x, (A.110) 
x 


8 Some care is needed here. In fact, notice that a force field could be exact but time dependent (i.¢., it could be that 
g = —V¢(x,t)). The most famous case is the restricted three-body problem expressed in the barycentric inertial (i.e., 
nonrotating) system: the force field produced by the two massive bodies is exact, but it is not conservative, and along the 
orbits of the test particle (of vanishingly small mass m) 
dE ag 


E=T+m@¢ ii m at (A.108) 
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where n is the unitary normal to ¥ at x. In case of Sh, the surface integral can be calculated 
by considering the parametrization x = x(u,v), and the normal and area of the surface 
element are given by 


ox >: Ox ; ; 

Pa ax 8 aj aj ax 9 
noose Pea |S KS | aa Sa |S) 2 Se aed. 
Ox Pees du Ov ou dv du Ov 

du ov 


(A.111) 


where the last identity derives from Eqs. (A.5) and (A.14). 

The Gauss (or divergence) theorem states that for any well-behaved vector field g in {t” 
and any well-behaved region of space Q bounded by the n — | dimensional surface 0Q with 
outward normal n at position x, the following striking identity holds: 


[ emanix= f divears (A.112) 


In other words, the integral flux of a vector field equals the volume integral of a mathemat- 
ical object called the divergence of the field. From the identity in Eq. (A.112), we obtain 
the coordinate-free expression of the divergence operator 


1 
div g = lim —— n)d"~!x, A.113 
ve> Ue VQ [em x ( ) 


where V ({2) is the volume of the region Q and the limit must exist independently of how 
Q —> 0 at x. In particular, in Cartesian coordinates it can be proved that 
divg = agi = (V,g), (A.114) 
OX; 
where the last notation derives from the formal identification of the operator V in 
Eq. (A.107) with a vector (the student should be aware that such an identification, if 
applied mechanically, can produce terribly wrong results!). A vector field so that div g = 0 
at all points in its definition (open) domain is called solenoidal. 

Similarly to the Gauss theorem, the Stokes (or circuitation) theorem says that for any 
well-behaved vector field in 3i? and any well-behaved (simple) closed curve 0© with a 
boundary of an (orientable) open surface &, the identity 


/ (g,dx) = / (rot g,n)d7x (A.115) 
CD») x 


holds. In other words, the circuitation of a vector field over a closed curve equals the flux of 
a mathematical object called the rotor (or curl) of the vector field, evaluated over any surface 
with a boundary given by the closed curve. Recall that in the integral in Eq. (A.115), the 
integration along 0» is in such a sense that it is “seen counterclockwise” when “observed 
along” n. In analogy with the divergence case, the component of rot g along the direction n 
at point x can be obtained by considering 
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(rotg,n) = lim 


50 S(d) en —— 


provided the limit exists independently of how the surface element & with normal n and 
area $() shrinks to 0 at x, and in Cartesian coordinates 


a 
rot g = €;jKe) =VAg. (A.117) 
OXK 


A vector field so that rot g = 0 at all points in its definition (open) domain is called closed 
(or irrotational). A simple application of the Stokes theorem is the deduction of the Green 
theorem for functions in i? simply by considering a vector field g = (g,(x, y), gy (x, y),0) 
and a regular closed curve in the plane z = 0. 

The integral representation of the operators div and rot is called coordinate-free because 
the operators at the right-hand side are expressed in terms of integral relations, which are 
of course independent of the specific coordinates adopted. This property is of great impor- 
tance not only because it allows us to obtain the expressions of the important differential 
operators in curvilinear coordinates (see Appendix A.8), but also because it makes apparent 
that an operator is independent of its representation, a concept that should be stressed to 
students. 

From Eqs. (A.107), (A.114), and (A.117), it is a useful exercise to show by direct 
evaluation that 


rot (grad f) = 0, div (rot g) = 0, (A.118) 


but the student is encouraged to reach the same conclusions by using the coordinate- 
free expressions in Eqs. (A.113) and (A.116). The first (i.e., the fact that an exact field 
is closed) can be obtained by considering the circuitation around arbitrary small surface 
areas containing the point of interest. The second is from the divergence theorem applied 
to rotg over a small volume whose bounding surface is ideally represented by joining 
two parts 02; and 022 separated by the same closed curve y. From the Stokes theorem, 
the total flux is 0, and by shrinking the volume to 0 the statement is proved. Here, we 
recall that the identities in Eq. (A.118) and the Gauss and Stokes theorems are just special 
cases of more general results obtained in the framework of differential forms.? A vector 
field is called solenoidal if div g over some region of space, closed (or irrotational) if 
rotg = 0, and finally exact if it can be derived as the gradient of a scalar function (the 
potential). This nomenclature reflects the fundamental importance of these concepts in fluid 
dynamics, successively extended to Hamiltonian dynamics in phase space and to classical 
electrodynamics. The two results above are remarkable in the sense that they connect local 
and global properties. In particular, the Stokes theorem produces important consequences 
when applied to closed fields in multiply connected regions; the student should deduce that 
circuitation along paths including the nonconnected regions of a closed field leads to results 


9 A differential k-form is called exact if it is the external differential 4 of a (k — 1) form and closed if its external differential is 0. 
A theorem from Poincaré states that an exact form f is necessarily closed (i.e., d0f = 0; e.g., Arnold 1978; Edwards 1994). 
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that are independent of the specific closed path adopted for the integration. The situation is 
analogous to the case of the residue theorem in complex analysis. 

A fundamental second-order linear operator (elliptic) is the Laplacian of a scalar func- 
tion 


af 


—, A.119 
m2 (A119) 


A = div grad f = > 
which is of central importance in physics, where the last expression holds in Cartesian 
coordinates. From the divergence theorem, we have 


/ Af d"x= / (n, grad f)d"~!x = / aT gly. (A.120) 
Q aQ aq on 
where the last expression introduces the so-called normal derivative of the function f along 
the direction n, with df/dn = (V f,n). From this expression, we can obtain the Laplacian 
as a limit in analogy with Eq. (A.113). A scalar field so that Af = 0 at all points in its 
(open) definition domain is called harmonic. Therefore, a vector field that is both solenoidal 
and exact can be expressed as the gradient of a harmonic function. 

The divergence theorem can also be used to obtain a very useful and remarkable 
identity as follows: let us consider the Gauss theorem in i” for the special field g = 
(0,...,8,.--,0), where only the i-th component is not 0. Therefore 


F 
[ arx= | gnjd"—!x. (A.121) 
Q OX; IQ 


By multiplication with the basis versors e; and with a suitable choice of function g, we can 
obtain several relevant coordinate-free identities that allow us to derive operators in generic 
coordinate systems from volume integration. In particular 


[ena patx= [ nfd"—'x, [ rosa'x= | nA gd°x, (A.122) 
Q dQ Q dQ 


and so the corresponding operators can be defined in general coordinate systems by con- 
sidering the limit of previous identities as in Eq. (A.113). We recall the alternative way to 
derive Eq. (A.122): applying the divergence theorem to the two vector fields af anda A g, 
where a is a constant but otherwise arbitrary vector. 

We conclude this section by mentioning that there exist several beautiful identities that 
can be constructed by using combinations of the differential operators introduced above 
(e.g., see Arfken and Weber 2005; Aris 1989; Jackson 1998), and the student should spend 
some time practicing and on inventing problems. Here, we simply recall 


rot (rot g) = grad (div g) — Ag, (A.123) 
which is of fundamental importance in electromagnetism, and 


grad (a,b) = (a, V)b + (b, V)a+aA rotb+b A rota, (A.124) 
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which is especially relevant for its use in fluid dynamics, when the two fields are taken to 
coincide with the fluid velocity u, allowing us to rewrite in a convenient form the advective 
part of the material derivative (u, V)u, and finally 


rot (aA b) = adivb — bdiva — (a, V)b + (b, V)a. (A.125) 


A.5 Green Identities 


A fundamental identity, which we used to prove the remarkable unicity theorem for the 
Poisson equation in Chapter 2, is obtained very easily from the Gauss theorem. We start by 
considering the special vector field 


u= fVg, (A.126) 


where f and g are two generic, well-behaved scalar functions from {t” to it of argument x. 
Taking the divergence of u and using the Gauss theorem, the first Green (1793-1841) 
identity follows: 


[rast wrvenax= f 8 ary (A.127) 
Q aQ on 


where 02 is the surface bounding the arbitrary volume Q and n is the outward normal (unit) 
vector to 0&2. The second Green identity can be obtained from the first identity simply by 
exchanging f and g in Eq. (A.127), and subtracting 
0 0 
/ (fAg — gAf)d"x = / p28 — pF) gry, (A.128) 
Q aQ on on 

A third Green identity also holds, but it is not used here, and so it is not reported (e.g., see 
Jackson 1998). 


A.6 The Helmholtz Decomposition Theorem 


The following result (which we present in 9t* but can be generalized in n dimensions in 
the framework of differential forms) explains why in physics we are so interested in the 
behavior of vector fields under the actions of operator divergence and rotor (just think 
about the Maxwell equations!). In fact, the Helmholtz (1821-1894) decomposition theorem 
(e.g., see Arfken and Weber 2005) proves that a generic vector field defined over a simplly 
connected region of 9? is fully (in the sense explained below) specified when its divergence 
and its rotor are known. For example, in fluid dynamics, this would be related to the 
knowledge of sources/sinks and vortices, and in electromagnetism to the specification of 
charges and currents. Technically, the theorem says that all well-behaved vector fields in 
x? with the same divergence and the same rotor are essentially the same, with the freedom 
(gauge) to add the gradient of a generic harmonic function. A proof is as follows: suppose 
we have a field g and we search for the most general form of fields g,. with the same rotor. 
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From the linearity of rot, the difference of the two fields is closed, and from the assumption 
that the domain is simply connected, it is also exact, i.e., 


rot(g, -g)=0 => g,=g+gradf (A.129) 


for generic functions f. In practice, all of the fields with the same rotor as g are given in 
Eq. (A.129). We now require that the two fields have the same divergence, so that 


0 =div(g, -—g) =Af, (A.130) 


so that f is harmonic, and this proves the theorem. Of course, we could start by searching 
for all fields g,. with the same divergence of g and, following a similar approach, conclude 
that now g, = g+ roth, for arbitrary functions h (the so-called vector potential). We now 
ask for the same rotor, and we obtain rot (roth) = 0 over a simply connected region, so 
that roth = grad f for some f, so that again we conclude that f is a harmonic function. 

Now, the Green theorem of unicity allows us to restrict the Helmholtz theorem by 
proving that, in addition to divergence and curl, the field g2 is also required to coincide 
with g; on the boundary of a volume, then g2 = g; (i.e., the field is uniquely determined). 
In fact, with this additional requirement, we have that inside the volume Ag = 0 and on the 
boundary grad ¢@ = 0. But if grad gd = 0, then d¢/dn = 0. We now apply the first Green 
identity with f = g = @¢, so that we have a Neumann problem. Following a line analogous 
to the discussion in Chapter 2, Section 2.1.1, it follows immediately that grad @ = 0 inside 
the volume, and this proves the theorem. 

Note that the unicity result just proved allows us to decompose a given vector field in 
a unique way. In fact, consider the field g and compute div g = p and rotg = w. Now, it 
follows that the field g can be written in a unique way as the sum of a solenoidal g, and an 
irrotational g; field. In fact, let g = g, +;, with div g, = 0 and rot g; = 0; we immediately 
see that divg; = p and rotg, = @, so that the two fields have specified divergence and 
rotor, and from the previous result they are uniquely determined. It is possible to show how 
to construct g explicitly by using some suitable gauge condition in Eq. (A.123), but we will 
not discuss this point here (e.g., see Arfken and Weber 2005; Becker 1982; Jackson 1998). 

We conclude by recalling that the Helmholtz decomposition is not the only possible (and 
remarkable) way to decompose a vector field. Other decompositions can be constructed 
based on the concept of complex lamellar and Beltrami fields (e.g., see Aris 1989), defined, 
respectively, as fields perpendicular and parallel to their rotor, i.e., 


(g,rotg)=0,  gA(rotg) =0. (A.131) 


Among several others applications, this characterization plays a fundamental role in the 
theory of hydrostatic equilibria (e.g., see Tassoul 1978). 


A.7 Green Functions 


In Chapter 2, we encountered the problem of the construction of the Green function for the 
Laplacian. Here, we provide general information on this fundamental mathematical tool 
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(e.g., see Arfken and Weber 2005; Bender and Orszag 1978; Jackson 1998; Lomen and 
Mark 1988). Suppose we look for the solution of the problem 


D[ fl = A), (A.132) 


where D is some linear differential operator,!° h(x) is a given function, and for simplicity 
we restrict ourselves to Cartesian coordinates in t”. We proceed formally, without entering 
the important questions of existence, regularity, etc., for a solution f, which we assume 
exists and has all of the properties needed for the following discussion. In the framework 
of the Green functions, the idea is not to solve directly Eq. (A.132), but to solve once and 
for all a problem so that the solutions for different choices of the source function h do not 
require us to solve different equations, but can be obtained once a Green function for the 
operator is known. A function G(x, y) will be called a Green function for the operator D if 


D[G] = 6(x—y). (A.133) 


If we know G, than we have solved all of the problems in Eq. (A.132)! In fact, consider 
(formally) 


f@= / z G(x,y)h(y) d"y, (A.134) 


and apply the operator D to it. Exchange the operator with the integral (intuitively, this can 
be done because D is linear and integrals are just sums) and use Eq. (A.95); this completes 
the “proof.” In practice, the function G describes the “reaction” of the operator to a very 
special source term, and the idea is that the solution of the general problem (thanks to the 
linearity of D) can be obtained as the weighted superposition of these special solutions. 
How can we construct the function G for a given D? A consideration is in order: intuitively, 
we can expect that the Green function is strictly related to the solution of the homogeneous 
case of Eq. (A.132) — after all, the 5-function in Eq. (A.133) is equal to 0 for all points of 
space x  y! The idea is to combine the general homogeneous solutions Go in a clever way 
so as to obtain the peculiar behavior of the 6-function at x = y. 
We first consider the case of a one-dimensional operator (e.g., see Eq. (6.12)) 


DUF] = fF) + ane) fe) + Fa) f () + age) fO@), (A135) 


where aj(x) are regular functions and with f“) we indicate the k-th derivative; as 
usual f© = f. Suppose we look for the solution of Eq. (A.133) for our n-th-order, 
one-dimensional differential operator, and we determined the general solution of the 
homegeneous problem D[ fo] = 0 


fo = - Ai foi(), (A.136) 


i=l 


10 We recall that for linear operators D[au + Bv] = aD[u] + BD{v]. 
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where A; are n arbitrary constants and fo; (x) are the n linearly independent solutions. The 
idea is to consider a different set of constants for x < y and x > y and to obtain the 
sought-after 5-function behavior at x = y by using Eq. (A.101). If we can fix the values of 
the constants in the two separated regions of space so that ic 
6-function, then f will behave as the Dirac 5-function, and the one-dimensional problem 
is solved because all of the lower-order derivatives remain finite and do not affect the 
behavior of the solution at the critical point x = y. The choice of the constants is now 
trivial: if the n — 1-th derivative is the 0-function, than the lower-order derivatives are 
continuous at x = y (integration improves regularity by unity!). In practice, the constants 
are fixed by requiring that for 


" behaves as the Heaviside 


Pater) = Sorin)» — Solter O) + 1 = Foright > (A.137) 
where the first identities must hold for k = 0,...,n — 2. Notice that if we look for the 
solution of the Green problem D[G] = Ad(x — y), the corresponding function is obtained 
simply by requiring a “jump” of A instead of unity. We can now discuss the full problem 
in Eq. (A.133): clearly, we now face a new geometric difficulty with respect to the one- 
dimensional case. In fact, while in the one-dimensional case the singularity of the right- 
hand side member is geometrically controlled by the fact that the “jump” can only happen 
along the x-coordinate, in more than one dimension, the jump could happen along arbitrary 
directions. How we can control this? The solution is provided by the method of separation 
of variables. In practice, we search for a homogeneous solution of Eq. (A.133) by using 
the method of the separation of variables. In many cases of interest (see Chapter 2), we 
obtain (complete) sets of orthogonal functions. Suppose that the separation of variables for 
our problem produced n — 1 families of orthogonal functions: the idea is to expand the 
(unknown) Green function G on these bases, evaluate D[G] for such an expansion, isolate 
the remaining one-dimensional Green problem, and finally solve it by using the approach 
described above. Intuitively, the expansion of the orthgonal functions allow us to “control” 
the jump along the “direction” of the remaining variable. As the student can see in the 
three explicit examples for the Laplacian in Chapter 2, in practical cases the method is less 
complicated than its description! 


A.8 Differential Operators in Orthogonal Curvilinear Coordinates 


We recall here the most important properties of (orthogonal) curvilinear coordinates; for 
simplicity, we will not discuss the strictly related and important subject of covariant and 
contravariant bases, and we will further restict ourselves to 3. Very clear and rigorous 
treatments can be found in Arfken and Weber (2005), Do Carmo (1976), and Narashiman 
(1993), and for the more physically oriented readers, useful references are Becker (1982), 
Lamb (1945), and Milne-Thomson (1996). 

We start by considering a system of curvilinear coordinates g;, with i = 1,2,3. With 
the compact notation q = (q1,q2,q3) we do not indicate a vector, but only a triplet of 
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curvilinear coordinates.!! Once the q coordinates are assigned, the relation between Carte- 
sian and curvilinear coordinates is known 


xX = x(q). (A.138) 


A simple geometric interpretation of coordinates in terms of the intersection of surfaces 
of constant g; is extremely useful for the following discussion, and here we simply recall 
that Cartesian coordinates can be interpreted as the intersection of mutually perpendicular 
planes, spherical coordinates of spheres, cones, and planes, and cylindrical coordinates of 
cylinders and two planes. 

It follows immediately that by fixing the values of two of the three curvilinear coordi- 
nates in Eq. (A.138) and considering the partial derivative of the resulting curve described 
by x as a function of the remaining g; coordinate, we determine the tangent to the coordi- 
nate curve, which we call 


ig ay OS (A.139) 


Ogi 
Curvilinear coordinates such that (x;,x;) = 0 fori # j are called orthogonal curvilin- 


ear coordinates, and in the following we restrict ourselves to this special but extremely 
important family. Of course, the /ength of the tangent 


hi(q) = |Ixill (A.140) 


in general changes along the coordinate curve, and the functions h; are known as the 
Lamé (1795-1870) coefficients of the coordinate system. It is natural to define a local 
orthonormal basis as 


fata t— i=1,2,3, (A.141) 


where no summation over the index 7 is intended. Without loss of generality, we can 
assume that the triplet q is ordered so that the associated f; form a positively oriented 
local basis, i.e., 


(f;,f;) = bij; f; A fp = fs, (A.142) 


where the indices in the last identity can be cyclically permuted. 
In Cartesian coordinates, x = xe, + yey + zez, and so hy = hy = h, = 1. In spherical 
coordinates (r,7,@), we have x = (r sin? cos g,r sin? sin g,r cos #), so that 
hy=1, hy =r, hg =rsind, 
f,, = (sin? cos g, sind sing, cos #), 
r= ( g p ) (A.143) 
fy = (cos 0 cos, cos 0 sing, — sin?), 


fp = (— sing, cos g,0), 


11 Tt should be unnecessary to remind the reader that not all ordered lists of numbers are vectors, but only those obeying the 
axioms of vector spaces. As an additional point, also notice that quite often curvilinear coordinates in physical problems have 
different physical units! 
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and x = rf,. In cylindrical coordinates (R,y,z), we have x = (R cos 9, R sing, z), so that 


hre=1, hg=R, hz=1, 

fr = (cosg, sing,0), 

a ene (A.144) 
fy = (— sing, cos g,0), 

f, = e, = (0, 0, 1), 


and x = Rfr + zf,. For the ellipsoidal coordinates (A, ju, v) defined for the unitary ellipsoid 
in Eq. (2.9), some more work is needed. The relevant transformation formulae can be 
established in a very elegant way starting from the formal factorization 


> x} (c — A) - WG -v) 
1)" ae l= me ; (A.145) 


where the function A(t) is given in Eq. (2.11). After multiplication of both members of 
the identity in Eq. (A.145) for A(t), the limits for tT = —ai, —as, and —az of the resulting 
identity show that 


ee (aj + A)(at + w) (af + v) 
(at —a3)(az — a3) 


’ 


2 (aS +A)aZ + waz tv) 
(az — at)(az — a3) 

2a (aj + A)(az + 4) (GZ + v) 
(az — at)(az — a3) 


: (A.146) 


: <v< —as <we< —az < i. Surfaces of constant 4 are confocal ellipsoids, 


where —a 
surfaces of constant jz are confocal hyperboloids of one sheet, and surfaces of constant v 
are confocal hyperboloids of two sheets. It is easy to show that the coordinates are locally 


orthogonal, that the Lamé coefficients are given by!” 


,, 1 /@-wa-y 
ae AA)” 


A(H) 2 —A(w) 
(v-—A)\v—p) 1 f/A-v)(u-v) 
A(v) ~9 A(v) 


12 Th the expression for /,,, the positivity of the arguments of the square root has been made apparent (see also Footnote | in 
Chapter 2). 
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and finally that the positively oriented local basis is (f,,f,,f,). Note that from Eq. (A.146) 
it follows that ellipsoidal coordinates are degenerate; in other words, at a given triplet 
(A, u,v), there correspond eight points with reflecting symmetry with respect the three 
coordinate planes of $7, a property with obvious consequences when computing surface or 
volume integrals in ellipsoidal coordinates (e.g., see Exercise 2.5). 

We are now in a position to generalize to the case of orthogonal curvilinear coordinates 
the basic differential and integral operators introduced in Appendix A.4. As a preparatory 
consideration, notice that for an arbitrary vector field g, the following obvious geometric 
relations hold: 


Z= giej= gifj, > 8; = (g,f;) = gi(e:,f;). (A.148) 


Therefore, from Eq. (A.107), it follows immediately that in the local f; basis (no summing 
over repeated indices is intended) 


(grad f); = (Vf) = ——, i =1,2,3. (A.149) 


We then generalize the path integral for a vector field g, such as that appearing in 
Eq. (A.109), as follows. We construct a regular curve y by considering the parameterization 
q(t) with t € [a,b], and then x[q(t)], and it follows immediately that 


3 b dqj 
[isan = ~ gihj edt. (A.150) 
Va j=l a 


A special (and relevant) case is for y coincident with a coordinate curve when, without 
loss of generality, we can assume t = gq; (say) for some i, and the path integral reduces to 
ia g; h,dq; (no summing over i intended). 

In order to generalize flux (and surface) integrals, such as that in Eq. (A.110), we 
consider the parameterization of a regular surface © in {? as q = q(u,v) with u € [a,b] 
and v € [c,d], and then x[q(u, v)]. Therefore, from Eq. (A.111) 


» wf’ [" agi 94) 
sn)d-x = fA £;)hjh; — —dudv. A.151 
[em Lf fe inch OE ay (A.151) 
Notice that from the second identity in Eq. (A.142), fori # j, the wedge product f; Af; = 
+f;,, with the sign determined by the parity of the (i, j,k). Again, a special and important 


case is obtained when the integration surface is coincident with some region of qx = const. 
so that for positive orientation 


n=f, d?x = hjhjdqidq;, (A.152) 


and so 


b pd 
[remax [ / gy hih jdqidq;. (A.153) 
Ss a Jc 
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If we now integrate over a small region on the surface and we use the Stokes theorem in 
Eq. (A.116), it is a nice exercise to show from Eqs. (A.150) and (A.151) that 


1 (agihs ag 
(rot gy), = oS (A.154) 
hoh3 \ dq2 0q3 


(and, correspondingly, making a cyclic exchange of the indices for the directions f2 and f3). 

We finally consider the volume integral. By changing coordinates from x to q, the 
determinant of the Jacobian is given by the triple product of the vectors in Eq. (A.139), 
so that in our case 


|det J| = | det(x,,x2,x3)| = hy hzh3| det(f,, fo, f3)| = hyhohs, (A.155) 


and in particular Eq. (A.97) follows. Considering now Eggs. (A.113) and (A.153) for a small 
volume limited by the three coordinate surfaces, we obtain (prove it!) 


1 pS x dgshyh3 i ey 


divg = 
hyhoh3 oq 0q2 0q3 


(A.156) 


and finally the expression for the Laplace operator is obtained by combining Eqs. (A.149) 
and (A.156) with g = grad f as follows: 


1 ad (hoh3 0 0 (hyh3 0 ad (hyh2 0 
Ayre (= ae (= r)4 (= F)]. casn 
hihzh3 Ldqi \ hi Ogi dq2 \ ho 0q2 0q3 \ h3 0q3 
With this expression, we are now in a position to use the Poisson equation in the coordinate 
systems adopted in stellar dynamics (see Chapter 2). 


A.9 The “Co-Area’”’ Theorem 


We used the co-area theorem in Chapter 2 when discussing the gravitational potential pro- 
duced by a triaxial ellipsoid with density stratified on homologous concentric and coaxial 
ellipsoidal surfaces described by Eq. (2.8). In particular, we faced the quite unintuitive 
fact that the spatial density can be interpreted as the sum of surface densities that are not 
constant over each ellipsoidal stratum. The co-area theorem not only explains why this 
happens, but also provides a general volume integration formula for stratified functions; 
technically, it can be viewed as a powerful change of the integration variable, and in 
the following we illustrate the qualitative idea behind the method. In practice (and for 
simplicity), suppose we have to integrate the product of a generic function f(x) for 9t? to 
ti and a function g stratified on some family of regular surfaces 


S(x) =m, (A.158) 


in analogy with Eq. (2.8). We search for an alternative expression of the volume integral 


T(m,m2) = | f®gIS@W]d3x. (A.159) 
m,<S(x)<m2 
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S(x + nd&) =m + dm 


S(x) =m+dm 


Figure A.2 Qualitative illustration of the geometric idea behind the co-area theorem. 


The idea is to split the integration first by conducting a two-dimensional integration over the 
surfaces of constant m, and then integrating over the range of m. As is shown in Figure A.2, 
we can imagine first introducing for each surface two coordinates €; and &2, as well as a 
coordinate €3 perpendicular to the tangent plane, so that the three coordinates (&), 2, &3) 
produce a local coordinate system. We must now substitute the variable €3 with the variable 
m, and this is accomplished by the requirement that the “length” 5&3 is chosen so that, 
qualitatively speaking, 5&| x 5&6&3 is the “volume” of the infinitesimal parallelepiped with 
a vertex at (€,&) on the surface S(x) = m, with lengths of the tangent basis vectors, and 
with height 6&3 coincident with the distance along the normal to the surface S(x) = m+ém. 
It is elementary to show that we are looking for a solution of the following identity: 


S(x)=m, S(x+nd&)=m+é6m, n= mi (A.160) 
IVSOOI |say=m 

Expanding to the linear order for vanishingly small 5m 

68 = ee meen (A.161) 
IVS) lscj=m 
so that finally 
ma f) 

T(m1,m2) = [ g(m)dm = VSI d°x, (A.162) 


where the appearance of “effective surface density,’ due to the fact that the two surfaces 
in Eq. (A.160) are in general not equidistant, is clear.!? The student should compare the 


13 For the student more interested in rigor, the formal approach would be to determine first the two-dimensional surface element 


a2x = hyhyl||fy A fo |\dé\ dé, (A.163) 
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obtained expression with Eq. (2.15), where the identification of g(m) = o(m) and f(x) = 
\|x—y||~! is apparent. This important result is not only at the basis of the computation of the 
gravitational potential of material ellipsoids and of the electrostatic potential of ellipsoidal 
conductors (elucidating why at the equilibrium charges accumulate in the regions of short 
curvature radius, where the surfaces m and m + 6m are more distant, and so more charges 
contribute to the local effective charge density); it also appears in statistical mechanics 
when computing the density of states over the constant-energy surfaces for microcanonical 
ensembles, with the well-known proportionality factor given by the inverse of the norm of 
the gradient of the Hamiltonian (e.g., see Khinchin 1949). 


A.10 Transport Theorems 


The mathematical results generally known as transport theorems plays a fundamental role 
in different areas of physics, ranging from fluid dynamics, to electrodynamics, to statistical 
mechanics; for example, the well-known Reynolds’ transport theorem, Kelvin’s transport 
theorem, and the transport integrals appearing in the Ampere and Faraday—Maxwell 
equations in integral form all belong to this family. In particular, in Chapter 9, Reynolds’ 
transport theorem has been applied to derive the collisionless Boltzmann equation in 
curvilinear coordinates. Transport theorems are important because they describe how some 
property of a “fluid” (in the broadest meaning) integrated over some region (volume, 
surface, curve) changes due to the “transport” produced by the solution of the evolutionary 
differential equation describing the system. The advantages of integral formulation over a 
differential formulation are obvious because the value of a given integral is independent 
of the specific coordinates used, and so transport theorems are natural tools to formalize 
physical principles and to obtain the corresponding differential equations. In this section, 
we will focus on the main logical steps behind the derivation and meaning of Reynolds’ 
transport theorem (for more extended discussions, see, among others, Aris 1989; Borisenko 
and Tarapov 1979; Currie 1993; Jackson 1998; Lamb 1945; Meyer 1982; Milne-Thomson 
1996; Narashiman 1993). 


where no assumptions about the orthogonality of fj and f2 are made. We then ask that the €3 coordinate, measured 
perpendicular to the surface, is the parameter m; from Eq. (A.158) 


a 
l= (vse, =| = hm (VS(x),f£m) = hm|IVS | > hm = (A.164) 


1 
VSI 
Therefore, the volume element becomes 


d2xdm 


———,, A.165 
IvSi ne 


a>x = hyhyhm det(fy.£o,f,.) = 


and this concludes the proof. 
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A.10.1 Ordinary Differential Equations 


In order to introduce the subject, we recall some basic facts about the general properties 
of ODEs in it” (e.g., see Arnold 1992; Ince 1927; Lomen and Mark 1988; Wintner 1947). 
As the discussion is quite general, in the following, by z we mean a list of n variables (not 
necessarily a vector). For example, in the simplest case z = x, the Cartesian coordinate 
vector in 3°, or fora point moving under the action of a force field in Cartesian coordinates, 
we could have z = (x,v), or Z = (q,p) when considering the motion in phase space 
induced by some Hamiltonian function, or z = (q, q) for the case of curvilinear coordinates 
and generalized velocities in the Euler-Lagrange framework, or finally z = (q,v) when 
decomposing the velocities over the local basis as in Chapter 9. 

Therefore, let A C St” be an open set and W(z) be a list of n functions A +> {”. The 
system 


: dz 
Z= ae = W(z), z(0) = Zo, (A.166) 


is called an (autonomous) ODE in it” with initial condition z 9; in most physical applica- 
tions, the parameter f is the time, and the functions W could also depend explicitly on time, 
but for simplicity in the following we restrict ourselves to the autonomous case. A function 
W:AxI+t> Xt”, where J is some (open) interval spanned by the parameter f, so that 


W(zo,t) = WI (zo, t)], WV (zo,0) = 20, (A.167) 


is called the solution with initial condition zo for the ODE. 

A central problem of the theory of ODEs is to determine whether a solution exists and 
what are its properties. For our purposes, the most important results concern the local 
existence, uniqueness, and regularity of the solution (details and formal proofs can be found 
in the list of references provided above). Regarding existence, if the components of W are 
at least continuous on some open set Uo ¢€ A, then the solution exists at least locally (i.e., 
ad > 0 exists for which Eq. (A.167) holds for |t| < 6). Moreover, if W is Lipschitz on 
Uo, then the solution not only exists, but it is also unique, with continuous dependence on 
the initial condition zg. Finally, if the components of W are smooth of order C’ (Up), with 
r > 1, then W(zo,t) € C’(Up) with respect to z) and W(z9,t) € C’+!(1) with respect 
to t. In summary, for sufficiently regular W, the three results just mentioned imply that the 
solution is time-invertible; in other words, for |t| < 6 and Vz) € Uo 


Z=WV(z,t), Zo = V[V(zo,t), — ft]. (A.168) 


With suggestive notation, we can write!4* w-!(z,t) = Wz, — £). 


!4 Technically, W is a one-parameter diffeomorphism, with W[W (zg, t)),to] = V(Zo,ty + tz) for arbitrary t; and ty; see Arnold 
(1992). 
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In practice, the previous results show that for arbitrary but fixed ¢, the components of V 
determine a regular change of variables between z and zo. Let Jo(t) be the determinant of 
the Jacobian matrix of the coordinate transformation z = W (Zo, t) 


OW; (Zo, ¢ 
Jo(t) = dor | CO? | aos ers (A.169) 
020; 

From uniqueness, Jo 4 0 over Uo x J, so that from the identity Jo(0) = 1, it follows that 
Jo(t) remains positive for t € J. We now prove the fundamental identity 
dJo "aw; 
ee 
dt we 3B OZj 

i=l Z=WV (Zo, 1) 


(A.170) 


In fact, from linear algebra, the derivative of the determinant of ann x n matrix with respect 
to some parameter equals the sum of determinants, where in each of them a different row 
(column) of the matrix is replaced by the derivative of the row (column). Equation (A.170) 
can be proved as follows: let us compute the first of our determinants associated with 
dJo/dt. For the j-th element in the first row, we have 


n 


d ow, a dV _ dW [WY (Zo, t)] = OW, OW, 3 OW, OW; 


dt dz; dz; dt deo; OW; dz; aW; 920; 


(A.171) 
i=2 
In practice, the first row of the first determinant is given by the first row of the matrix in 
Eq. (A.169) multiplied by dW, /0W;, plus the linear combination of the rows i = 2,...,n 
with coefficients dW; /dW;. The resulting determinant is independent of such a linear 
combination, and it sums to JodW ,/dW,. Equation (A.170) is then proved by repeating 
the same calculation for the remaining n — 1 determinants and adding the results. 


A.10.2 Material Derivative 


A key concept associated with the solution of the ODE in Eq. (A.167), and central in the 
following discussion, is built by considering the time evolution of some given function 
f = f(t) when computed along the solution with initial condition zp. The quantity 


fc (Zo,t) = f[V (Zo; t),t] (A.172) 


is called the Lagrangian function associated with the initial condition Zp. From a physical 
point of view, fr describes how the “property” f associated with zo at t = 0, evolves with 
time. 

The material (or Lagrangian) derivative of f along the solutions of W with initial 
condition Zo is immediately obtained from the standard differentiation rule of composite 
functions 


(A.173) 


dfc(zo,t) _ Df Df _ of +o of 
dt Dt |,-weyt) Dt at ez; 
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where the differential operator D/ Dt is the material derivative in Eulerian form. Of course, 
the material derivative obeys the same rules of standard total derivatives when considering 
sums and products of functions, etc. In Chapters 9 and 10, examples of material derivatives 
of astrophysical interest are discussed. Notice that a quantity is conserved along a solution 
if and only if its material derivative vanishes. 


A.10.3 Reynolds’ Transport Theorem 


We are finally in a position to consider an arbitrary region Q(0) C Mt” at t = O. The 
solution V (zo; t) induced by W “moves” each point of (2(0), transforming (2 (0) into Q(t). 
The function 


F(t)= f(,t)d"2 = / fc(to,t) Jo(t)d" 29 (A.174) 
Q(t) (0) 


is the integral of f over the region Q(t). We wish to determine the explicit expression of 
dF /dt. The method is based on the possibility of changing variables from z back to zo from 
Eq. (A.168), where Jo > 0 is given by Eq. (A.169) and the integration domain is mapped 
back to A(Q) and so is time-independent. From differentiation under the sign of the integral 


= =1 4 f Jo(t)d" zo 
dt 20) \ Dt dX a eee 
Df = OW; 
_ ans ns a"z, A175 
[,., Dt - d, OZi 


where we used the first identity in Eq. (A.173) and Eq. (A.170), and the last expression!> 
is obtained by reverting the coordinates from Zo to z. 


15. Two other relevant transport theorems (e.g., see Aris 1989; Jackson 1998; and in particular Truesdell 1954) that are often 
encountered in mechanics, fluid dynamics, and electrodynamics are related to the transport of the flux of a vector field in om 
through a time-dependent surface © (t) 


d 2 dg ; 2 
— (g,n)d-x + W div g + rot (g A W) | ,n)d°x 
dt Je) Zc) AL OF 


[ [32 + gdivW ce. vW] dx, (A.176) 
x(t) \L Dt 


where the equivalence of the two formulations can be established by using Eq. (A.125), and the second expression is obtained 
from the result (that we do not prove here) 


D aw; 
pie» = [civ wom = ae, nj |x (A.177) 
if | 


and to the transport of circuitation of a vector field (sometimes known as Kelvin’s theorem when applied to the fluid velocity 
u = W) evaluated along a time-dependent curve y(t) 


d | Dg b dWe 
= (g,dx) = [ ( vax) i Be. da. (A.178) 
dt Jy) y(t) \ Dt a Loa 


In particular, for g = u, the identity in Eq. (A.178) in the case of an inviscid, barotropic fluid under the action of conservative 
forces as obtained by specializing Eq. (10.23) shows that the velocity circuitation is conserved along the fluid motion, and so 
under these assumptions and the Stokes theorem, the fluid vorticity @ = rotu cannot be created or destroyed. 
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A first important application of the previous result is obtained under the assumption 
that dF (t)/dt = 0 for all possible choices of the control volume (2 (0), a case that is often 
encountered in physics. In this, the transport theorem is used to deduce the equation obeyed 
by the integrand, and we get 


we oT = 74 > a =0, = (A.179) 


where the second expression is obtained by using Eq. (A.173). In fact, if the integrand in 
the last integral of Eq. (A.175) would be different from 0 at some point (and so over some 
small region around the point for a well-known theorem from calculus), by restricting (2 (0) 
to such a region we would obtain a nonzero integral, which goes against the assumption. If 
in addition W is solenoidal, we have 


OW; Df 
a = ae (A.180) 
A second application of the transport theorem is obtained when we know something 
about the integrand in Eq. (A.175), and we use this information to deduce the behavior 
of dF (t)/dt. One preeminent example is given by the Liouville (1809-1882) theorem: 
the fact that the volume in phase space of a Hamiltonian system is conserved. In fact, fix 
Ff = 1 in Eq. (A.174), so that F(t) = Volume[Q2(t)], and then use Eq. (A.175). We deduce 
immediately that the volume contracts in cases of the motion produced by a field W having 
negative divergence and expands if the divergence is positive. As special case is that of 
solenoidal velocity fields (e.g., Hamiltonian flows in phase space): for all Hamiltonian 
fields, the phase-space volume is conserved, and so Eq. (A.180) holds. Moreover, the same 
equation holds for all canonical changes of coordinates, as they leave invariant the structure 
of Hamilton equations, and the Jacobian matrix of the transformation is symplectic, so that 
its determinant is identically 1 (see Chapter 9). 
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Jacobi’s integral, 301 
Jacobi—Lie integrability, 162 
Jaffe model, 245 

Jeans equations, 172, 174 
Jeans theorem, 161, 166 


Kelvin theorem, 331 

Kelvin time, 116 

Kinetic energy tensors, 185 
King models, 226 

King’s photometric profile, 243 
King’s sphere, 65, 244, 270 
Kormendy law, 118 

Kuzmin disk, 251, 255, 272 
Kuzmin’s theorem, 252 


Lagrange libration points, 302 
Lagrange—Jacobi identity, 107, 120 
Lamé coefficients, 167, 323 
Lane—Emden equation, 225 

Laplace equation, 18 

Laplace theorem, 110 
Laplace—Runge-—Lenz vector, 78, 79 
Laplacian, 17, 30, 318 

Legendre associate functions, 32, 308 
Legendre functions, 308 

Legendre integral transform, 43 
Legendre polynomials, 34, 308 
Levi—Civita tensor, 299 

Libration, 56 

Lie algebra, 165, 276 

Lie—Jacobi integrability, 78 


346 Index 


Line of sight, 197 

Line profile, 203 

Liouville equation, 153 

Liouville theorem, 153, 157, 158, 332 

Logarithmic density slope, 290 

Logarithmic integral, 306 

Los projected streaming velocity field, 201 

Los projected velocity dispersion field of order N, 201 
Los projected velocity field of order NV, 200 

Los velocity dispersion field of order N, 201 


Maclaurin disk, 97, 272 

Macroscopic function, 151, 153, 154, 173 
Macrostate, 151 

Master equation, 160 

Material derivative, 175, 330 
Maxwell—Boltzmann, 133 
Maxwell—Boltzmann distribution, 135 
Mean free path, 10 

Merging of galaxies, 117 

Meridional plane, 84 

Mestel disk, 46, 95, 96, 272 

Method of moments, 172 

Michie models, 226 

Microscopic function, 173 
Microstate, 151 

Mixed product, 299 

Miyamoto—Nagai model, 250, 255, 261, 272 
Modified Bessel functions, 45, 311 
MOND, 47 

Monopole potential, 29 

Movable singularities, 108 

Multipole expansion, 28 


Navarro—Frenk—White profile, 245, 270 
Neumann’s problem, 18 

Newton’s first theorem, 5 

Newton’s second theorem, 5 

Newton’s third theorem, 20 

Newtonian fluids, 177 

Noether’s theorem, 78, 166 

Norm, 296 


Oort’s constants, 90, 93 

Orbital eccentricity, 79 

Orbital family, 84 

Orbital precession, 88, 101 

Osipkov—Merritt models, 220, 256 
Ostriker—Peebles stability criterion, 114, 280 


p-Laplacian, 47 

Perfect ellipsoid, 41, 65, 244, 270 

Pericenter, 81, 87 

Phase mixing, 114 

Phase-space distribution function, 125 

Phase-space velocity dispersion field of order N, 201 


Phase-space velocity field of order NV, 200 
Planar solution, 109 

Plummer model, 225, 244, 254, 270, 274, 294 
Poisson bracket, 157, 165 

Poisson equation, 17, 217 

Polylogarithm, 306 

Polytropic index, 225 

Power-law models, 243, 257 

Projected luminosity, 239 

Projected mass, 239 

Projected mass density, 238 

Projected streaming velocity field, 200 
Projected velocity dispersion field of order N, 201 
Projected velocity field of order N, 200 
Projected virial theorem, 205 

Proper motion, 94 

Pseudo-collisions, 108 


Quadrupole potential, 29 
Quadrupole tensor, 29, 41 
Quasi-isothermal sphere, 244 


r— force, 11, 48, 82, 120, 192 
y-models, 40, 65, 245, 270 
Radial migration, 77 

Radial orbit instability, 114, 269, 280 
Rayleigh criterion, 87 
Reciprocity theorem, 188 
Rectilinear solution, 109 
Reduced mass, 72, 127 
Relative energy, 212 

Relative equilibrium, 109 
Relative orbit, 71 

Relative potential, 212 
Relative velocity, 71 

Reynolds’ transport theorem, 176, 328 
Riesz potential, 48, 135 

Roche limit, 68 

Roche potential, 302 
Rodrigues identity, 308 
Rosenbluth potential, 131, 136 
rot, 14 

Rotation curve, 94 


Satoh decomposition, 260 

Satoh models, 250 

Scaling laws, 117 

Second quantum number, 32 

Semilatus rectum, 79 

Separation of variables, 31 

Sersic profile, 241, 271 

Shear tensor, 177 

Sine integral, 306 

Singular isothermal ellipsoid, 193 

Singular isothermal sphere, 36, 90, 99, 233, 
243, 275 


Index 347 


Slingshot effect, 70, 74 Tully—Fisher, 86 

Solenoidal field, 317 Two-body relaxation time, 126 
Spherical harmonics, 32 

Spiral structure, 90 Ultraviolet divergence, 130 
Stickel models, 25, 251 Unicity theorem, 18, 21 
Stellar polytropes, 224 

Stirling expansion, 304 Vector product, 297 

Stokes’ theorem, 14, 316 Velocity dispersion ellipsoid, 173 
Streaming velocity, 173 Velocity dispersion tensor, 173 
Stress tensor, 177 Velocity moments, 173 

Strip brightness, 239, 241 Velocity profile, 203 

Strong homology, 268 Velocity section, 213 

Struve functions, 47 Versor, 297 

Sturm—Liouville theorem, 309 Violent relaxation, 113 
Sundman’s inequality, 107, 121 Virial plane, 114 
Superposition principle, 4 Virial radius, 112 

Surface brightness profile, 238 Virial theorem, 111 

Syzygial solution, 109 Virial velocity dispersion, 112 


Vlasov equation, 155 
Tensor virial theorem, 185 


Tidal disruption, 68 Wave vector, 313 

Tidal field, 49 Weak homology, 268 

Tidal potential, 50 Weierstrass-Sundman theorem, 109, 122 
Tidal tensor, 50 Wilson models, 226 

Tidal torque, 55 Wronskian, 309 

Toomre stability criterion, 114, 280 

True anomaly, 79 Yukawa potential, 99 


Truncated constant-density disk, 97, 272 
Truncation radius, 238 Zero-velocity curve, 85, 87 


